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Probability of Error for Optimal Codes 
in a Gaussian Channel 


By CLAUDE E. SHANNON 
(Manuscript received October 17, 1958) 


A study is made of coding and decoding systems for a continuous channel 
with an additive gaussian noise and subject to an average power limitation 
at the transmitter. Upper and lower bounds are found for the error prob- 
ability in decoding with optimal codes and decoding systems. These bounds 
are close together for signaling rates near channel capacity and also for sig- 
naling rates near zero, but diverge between. Curves exhibiting these bounds 
are given. 


I. INTRODUCTION 


Consider a communication channel of the following type: Once each 
second a real number may be chosen at the transmitting point. This 
number is transmitted to the receiving point but is perturbed by an 
additive gaussian noise, so that the ith real number, s;, is received as 
8s; + 2;. The 2; are assumed independent gaussian random variables all 
with the same variance N. 

A code word of length n for such a channel is a sequence of n real 
numbers (s;, 82, °°: , 8»). This may be thought of geometrically as a 
point in n-dimensional Euclidean space. The effect of noise is then to 
move this point to a nearby point according to a spherical gaussian 
distribution. 

A block code of length n with M words is a mapping of the integers 1, 
2, -:: , M into a set of M code words uw, , w:, --- , wy (not necessarily 
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all distinct). Thus, geometrically, a block code consists of a collection 
of M (or less) points with associated integers. It may be thought of as 
a way of transmitting an integer from | to M to the receiving point (by 
sending the corresponding code word). A decoding system for such a code 
is a partitioning of the n-dimensional space into M subsets correspond- 
ing to the integers from | to W/. This is a way of deciding, at the receiv- 
ing point, on the transmitted integer. If the received signal is in subset 
S,, the transmitted message is taken to be integer 7. 

We shall assume throughout that all integers from 1 to WM occur as 
messages with equal probability 1/1/. There is, then, for a given code 
and decoding system, a definite probability of error for transmitting a 
message. This is given by 


1 M 
Pe = ap Pais 


where P,; is the probability, if code word w, is sent, that it will be de- 
coded as an integer other than 7. P,; is, of course, the total probability 
under the gaussian distribution, centered on w; in the region comple- 
mentary to S;. 

An optimal decoding system for a code is one which minimizes the 
probability of error for the code. Since the gaussian density is monotone 
decreasing with distance, an optimal decoding system for a given code 
is one which decodes any received signal as the integer corresponding 
to the geometrically nearest code word. If there are several code words 
at the same minimal distance, any of these may be used without affect- 
ing the probability of error. A decoding system of this sort is called mini- 
mum distance decoding or maximum likelihood decoding. It results in a 
partitioning of the n-dimensional space into n-dimensional polyhedra, 
or polytopes, around the different signal points, each polyhedron bounded 


by a finite number (not more than M — 1) of (n — 1)-dimensional hy- 


perplanes. 

We are interested in the problem of finding good codes, that is, plac- 
ing / points in such a way as to minimize the probability of error P, . 
If there were no conditions on the code words, it is evident that the 
probability of error could be made as small as desired for any M, n and 
N by placing the code words at sufficiently widely separated points in 
the n space. In normal applications, however, there will be limitations 
on the choice of code words that prevent this type of solution. An inter- 
esting case that has been considered in the past is that of placing some 
kind of average power limitation on the code words; the distance of the 
points from the origin should not be too great. We may define three 
different possible limitations of this sort: 
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i. All code words are required to have exactly the same power P or the 
same distance from the origin. Thus, we are required to choose for code 
words points lying on the surface of a sphere of radius ~/nP. 

ii. All code words have power P or less. Here all code words are re- 
quired to lie interior to or on the surface of a sphere of radius +/nP. 

iii. The average power of all code words is P or less. Here, individual 
code words may have a greater squared distance than nP but the aver- 
age of the set of squared distances cannot exceed nP. 

These three cases lead to quite similar results, as we shall see. The 
first condition is simpler and leads to somewhat sharper conclusions 
we shall first analyze this case and use these results for the other two 
conditions. Therefore, until the contrary is stated, we assume all code words 
to lie on the sphere of radius ~WnP. 

Our first problem is to estimate, as well as possible, the probability 
of error P.(M, n, P/N) for the best code of length n containing M 
words each of power P and perturbed by noise of variance N. This mini- 
mal or optimal probability of error we denote by Peop: (M,n,V P/N). It 
is clear that, for fixed M, n, P. opt will be a function only of the quotient 
A = VP/N by change of scale in the geometrical picture. We shall ob- 
tain upper and lower bounds on P, op: of several different types. Over an 
important range of values these bounds are reasonably close together, 
giving good estimates of P, op: . Some calculated values and curves are 
given and the bounds are used to develop other bounds for the second 
and third type conditions on the code words. 

The geometrical approach we use is akin to that previously used by 
the author’ but carried here to a numerical conclusion. The problem is 
also close to that studied by Rice,’ who obtained an estimate similar to 
but not as sharp as one of our upper bounds. The work here is also 
analogous to bounds given by Elias’ for the binary symmetric and binary 
erasure channels, and related to bounds for the general discrete memory- 
less channel given by the author.’ 

In a general way, our bounds, both upper and lower, vary exponen- 
tially with n for a fixed signaling rate, R, and fixed P/N. In faet, they 
all can be put [letting R = (1/n) log M, so that R is the transmitting 
rate for the code] in the form 

‘i E(R) n+o(n) ( 
where £(R) is a suitable function of R (and of P/N, which we think 
of as a fixed parameter). [In (1), o(m) is a term of order less than n; as 
n — & it becomes small relative to E(R)n.| 

Thus, for large n, the logarithm of the bound increases linearly with 


n or, more precisely, the ratio of this logarithm to n approaches a con- 


> 
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stant E(k). This quantity H(R) gives a crude measure of how rapidly 
the probability of error approaches zero. We will call this type of quan- 
tity a reliability. More precisely, we may define the reliability for a 
channel as follows: 


E(R) = lim sup — . log Pe opt(R,n), 
n 


no 


where P, op» (2,n) is the optimal probability of error for codes of rate R 
and length n. We will find that our bounds determine F(R) exactly over 
an important range of rates, from a certain critical rate R, up to channel 
capacity. Between zero and R,, EF is not exactly determined by our 
bounds, but lies within a not too wide range. 

In connection with the reliability #, it may be noted that, in (1) 
above, knowledge of F(R) and n does not closely determine the proba- 
bility of error, even when n is large; the term o(n) can cause a large 
and, in fact, increasing multiplier. On the other hand, given a desired 
probability of error and F(R), the necessary value of the code length n 
will be sharply determined when n is large; in fact, n will be asymptotic 
to —(1/E) log P,. This inverse problem is perhaps the more natural 
one in applications: given a required level of probability of error, how 
long must the code be? 

The type of channel we are studying here is, of course, closely related 
to a band-limited channel (W cycles per second wide) perturbed by 
white gaussian noise. In a sense, such a band-limited channel can be 
thought of as having 2W coordinates per second, each independently 
perturbed by a gaussian variable. However, such an identification must 
be treated with care, since to control these degrees of freedom physically 
and stay strictly within the bandwidth would require an infinite delay. 

It is possible to stay very closely within a bandwidth W with a large 
but finite delay 7’, for example, by using (sin x)/z pulses with one tail 
deleted 7 from the maximum point. This deletion causes a spill-over 
outside the band of not more than the energy of the deleted part, an 
amount less than 1/7’ for the unit (sin x2)/x case. By making T large, 
we can approach the situation of staying within the allotted bandwidth 
and also, for example, approach zero probability of error at signaling 
rates close to channel capacity. 


However, for the problems we are studying here, delay as related to 
probability of error is of fundamental importance and, in applications 
of our results to such band-limited channels, the additional delay in- 
volved in staying closely within the allotted channel must be remem- 
bered. This is the reason for defining the channel as we have above. 
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II. SUMMARY 


In this section we summarize briefly the main results obtained in the 


paper, both for easy 


reference and for readers who may be interested 


in the results without wishing to work through the detailed analysis. It 
might be said that the algebra involved is in several places unusually 


tedious. 


We use the following notations: 


P = 


G = 


Q(8) = 


Ex ¢) = 


Peon (n, R, A \.= 


signal power (each code word is on the surface of 
a sphere of radius ~/nP); 

noise power (variance N in each dimension) ; 
V/P/N = signal-to-noise “amplitude” 
number of dimensions or block length of code; 


ratio; 


number of code words; 

(1/n) log M = signaling rate for a code (natural 
units) ; 

1 log (P + N)/N = } log (A* + 1) = channel 
capacity (per degree of freedom) ; 

variable for half-angle of cones appearing in the 
geometrical problem which follows; 

solid angle in n space of a cone of half-angle @, or 
area of unit n sphere cut out by the cone; 
cot'A = cone angle relating to channel capacity; 


= cone angle such that the solid angle 2(@,) of this 


cone is (1/4)Q(), [the solid angle of a sphere is 
Q(a)]; thus, 6; is a cone angle related to the rate 
R; 

G(0) = 4(A cos 6 + WA? cos? 6 + 4), a quan- 
tity which appears often in the formulas; 

the solution of 2 cos 6. — AG(@.) sin’ 6. = 0 (this 
critical angle is important in that the nature of 
the bounds change according as 6; > 6, or 0; < @.); 
()(6, A, n) = probability of a point X in n space, 
at distance A+/n from the origin, being moved 
outside a circular cone of half-angle @ with vertex 
at the origin O and axis OX (the perturbation is 
assumed spherical gaussian with unit variance in 
all dimensions) ; 

A’/2 — 4AG cos 6 — log (G sin 8), an exponent 
appearing in our bounds; 

Probability of error for the best code of length n, 
signal-to-noise ratio A and rate R; 
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#(X) = normal distribution with zero mean and unit vari- 
ance. 
The results of the paper will now be summarized. P, 4» can be bounded 
as follows: 


"! 018) 


Q(A:) S Pc ot S&S QA) — / 5 4Q(0). (3) 


= Jo QA; 
[Here dQ(@) is negative, so the right additional term is positive.] These 
bounds can be written in terms of rather complex integrals. To obtain 
more insight into their behavior, we obtain, in the first place, asymptotic 
expressions for these bounds when n is large and, in the second place, 
cruder bounds which, however, are expressed in terms of elementary 
functions without integrals. 
The asymptotic lower bound is (asymptotically correct as n — = ) 
l 


Q(6;) a - er e Ey (9,)n 
\/nr G v/1 + (G? sin 6; (cos 6; — AG sin’ 6;) 





a(@,) By (01) 
ar g UL 


, (0, > 0). 
Vn 


The asymptotic upper bound is 


r" O(@) AA) 08 0, — AG sin’ 6 
Q(0,) — dQ(0) wae Fenty — SO ees a 


J0 02(0;) n 2 cos A, — AG sin? 0; 


s 


This formula is valid for 4) < 6, < 6, . In this range the upper and lower 
asymptotic bounds differ only by the factor in parentheses independent 
of n. Thus, asymptotically, the probability of error is determined by these 
relations to within a multiplying factor depending on the rate. For rates 
near channel capacity (6; near @)) the factor is just a little over unity; 
the bounds are close together. For lower rates near R, (corresponding 
to 6.), the factor becomes large. For 6; > @, the upper bound asymptote 


18 


| 
cos 6, sin’ 0.G(0,.) Wark” (6.)[1 + G(6.)|? 


n{E (6, 





(6) 


In addition to the asymptotic bound, we also obtain firm bounds, 
valid for all n, but poorer than the asymptotic bounds when n is large. 
The firm lower bound is 


> > I Vn —s e a. e Fun 
~ On(A + 1)8etHrn 
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It may be seen that this is equal to the asymptotic bound multiplied by 
a factor essentially independent of n. The firm upper bound {valid if 
the maximum of G" (sin 6)°" * exp [— (n/2)(A® — AG cos @)] in the 
range 0 to 6; occurs at 6} is 


P; opt < A:0/ 2n e *G" (0;) sin 6," . exp k ( ~A° + AG COS a)| 


l 
nd, min [A, AG(@,) sin 6 — cot 4)) 


For rates near channel capacity, the upper and lower asymptotic 


bounds are both approximately the same, giving, where n is large and 
C — R small (but positive): 


,) in, / / 
Fs opt —™ I | v n V N(P +. DN) (I 


where ® is the normal distribution with unit variance. 


/2P(P + N) “i | 


To relate the angle 0, inthe above formulas to the rate R, inequalities 
are found: 


n , 7 
r (: > ) (sin @,) 





1 
l 2 nk 
1 ——tan 4} S « 
[ntl 1/2 mn 
ni 7 Tr cosh, 


r (" + ) (sin 0,)" 
< 2 


ne ( n+l ) * cos 0; 


») 


Asymptotically, it follows that: 


sin” 6; 
7 : . (11) 
V 2rn sin 4, cos A; 
lor low rates (particularly R < R,.), the above bounds diverge and 
give less information. Two different arguments lead to other bounds 
useful at low rates. The low rate upper bound is: 


| 
\AV en 


(12) 


where X satisfies R = [1 — (1/n)] log (sin 2 sin’ \/+/2). Note that 
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as R — 0,’ — 1 and the upper bound is approximately 
l 
j—— € 
Av rn 


The low rate lower bound may be written 


: —- 2M n\"” . 
P. opt = 5 2M oe i *) | (13) 


For M large, this bound is close to 4@(—A+Wn/2) and, if n is large, 


nA2/4 


this is asymptotic to 1/(A+/an) & "*"'*. Thus, for rates close to zero and 
large n we again have a situation where the bounds are close together 
and give a sharp evaluation of P, opt . 

With codes of rate R 2 C + e, where e is fixed and positive, P, op: 
approaches unity as the code length n increases. 
Ill. THE LOWER BOUND BY THE “‘SPHERE-PACKING”? ARGUMENT 

Suppose we have a code with M points each at distance ~/nP from 
the origin in n space. Since any two words are at equal distance from 
the origin, the n — 1 hyperplane which bisects the connecting line passes 
through the origin. Thus, all of the hyperplanes which determine the 
polyhedra surrounding these points (for the optimal decoding system ) 
pass through the origin. These polyhedra, therefore, are pyramids with 
apexes at the origin. The probability of error for the code is 


1 M 
Md Pe 


where P,; is the probability, if code word 7 is used, that it will be carried 
by the noise outside the pyramid around the 7th word. The probability 
of being correct is 


1 M . af 
— 2, Pai = 


=I M ? 


1 M 


Ch gs) s 
1 


that is, the average probability of a code word being moved to a point 
within its own pyramid. 

Let the ith pyramid have a solid angle Q; (that is, Q; is the area cut 
out by the pyramid on the unit n-dimensional spherical surface). Con- 
sider, for comparison, a right circular n-dimensional cone with the same 
solid angle Q; and having a code word on its axis at distance ~/nP. We 
assert that the probability of this comparison point being moved to within 
its cone is greater than that of w; being moved to within its pyramid. This 
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is because of the monotone decreasing probability density with distance 
from the code word. The pyramid can be deformed into the cone by 
moving small conical elements from far distances to nearer distances, 
this movement continually increasing probability. This is suggested for 
a three-dimensional case in Fig. 1. Moving small conical elements from 
outside the cone to inside it increases probability, since the probability 
density is greater inside the cone than outside. Formally, this follows 
by integrating the probability density over the region R, in the cone 
but not in the pyramid, and in the region R, in the pyramid but not in 
the cone. The first is greater than the solid angle Q of R, times the 
density at the edge of the cone. The value for the pyramid is less than 
the same quantity. 

We have, then, a bound on the probability of error P, for a given 
code: 


. 

P.= = 1 Q*(m), (14) 
M i=1 

where 0; is the solid angle for the 7th pyramid, and Q*(Q) is the proba- 

bility of a point being carried outside a surrounding cone of solid angle 

Q. It is also true that 


M 
> 2; = %, 
i=1 


the solid angle of an n sphere, since the original pyramids corresponded 
to a partitioning of the sphere. Now, using again the property that the 
density decreases with distance, it follows that Q*(Q) isa convex function 
of 2. Then we may further simplify this bound by replacing each Q; by 


CONE 
ELEMENT IN: 
_--DR; 


DRe 


Fig. 1 — Pyramid deformed into cone by moving small conical elements from 
far to nearer distances, 
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the average Q)/M. In facet, 


ley, — 
7 2X @ (Q:) = Q (*), 


(Xo 
2 )* . 
s . ( (*). 


It is more convenient to work in terms of the half-cone angle @ rather 


and hence 


than solid angles 2. We define Q(@) to be the probability of being carried 
outside a cone of half-angle 6. Then, if 6; corresponds to the cone of 
solid angle Qo/., the bound above may be written 


P, = OA). (15) 


This is our fundamental lower bound for P,. It still needs translation 
into terms of P, N, M and n, and estimation in terms of simple fune- 
tions. 

It may be noted that this bound is exactly the probability of error 
that would occur if it were possible to subdivide the space into MW con- 
gruent cones, one for each code word, and place the code words on the 
axes of these cones. It is, of course, very plausible intuitively that any 
actual code would have a higher probability of error than would that 
with such a conical partitioning. Such a partitioning clearly is possible 
only for n lor2.#f M > 2. 

The lower bound Q(6,) can be evaluated in terms of a distribution 
familiar to statisticians as the noncentral ¢-distribution.” The noncentral 
t may be thought of as the probability that the ratio of a random vari- 
able (z + 6) to the root mean square of f other random variables 


/\ , 
V f Z Fe 


does not exceed ¢, where all variates 2; and 2 are gaussian and independ- 
ent with mean zero and unit variance and 6 is a constant. Thus, denot- 
ing it by P(f, 6, t), we have 


|. 2s 


4 /\ f . <t>. 
\V f » . | 


P(f, 6, t) Pr (16) 


In terms of our geometrical picture, this amounts to a spherical gaussian 
distribution with unit variance about a point 6 from the origin in f + 1 
space. The probability P(f, 6, 4) is the probability of being outside a 
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cone from the origin having the line segment to the center of the dis- 
tribution as axis. The cotangent of the half-cone angle @ is t/+/ f. Thus 
the probability Q(@) is seen to be given by 


/n 2 
Q(@) = P(n=1, (ap Vn — [cot 8), (17) 


The noncentral ¢-distribution does not appear to have been very exten- 
sively tabled. Johnson and Welch’ give some tables, but they are aimed 
at other types of application and are inconvenient for the purpose at 
hand. Further, they do not go to large values of n. We therefore will 
estimate this lower bound by developing an asymptotic formula for the 
cumulative distribution Q(@) and also the density distribution dQ/dé. 
First, however, we will find an upper bound on Po): in terms of the 
same distribution Q(@). 


IV. UPPER BOUND BY A RANDOM CODE METHOD 


The upper bound for P, 9): will be found by using an argument based 
on random codes. Consider the ensemble of codes obtained by placing 
M points randomly on the surface of a sphere of radius ~/nP. More 
precisely, each point is placed independently of all others with probabil- 
ity measure proportional to surface area or, equivalently, to solid angle. 
Kach of the codes in the ensemble is to be decoded by the minimum 
distance process. We wish to compute the average probability of error for 
this ensemble of codes. 

Because of the symmetry of the code points, the probability of error 
averaged over the ensemble will be equal to MW times the average proba- 
bility of error due to any particular code point, for example, code point 
1. This may be computed as follows. The probability of message number 
1 being transmitted is 1/M. The differential probability that it will 
be displaced by the noise into the region between a cone of half-angle @ 
and one of half-angle @ + dé (these cones having vertex at the origin 
and axis out to code word 1) is —dQ(@). |Reeall that Q(@) was defined 
as the probability that noise would carry a point outside the cone of 
angle @ with axis through the signal point.] Now consider the cone of 
half-angle @ surrounding such a received point (not the cone about the 
message point just described). If this cone is empty of signal points, 
the received word will be decoded correctly as message 1. If it is not 
empty, other points will be nearer and the received signal will be incor- 
rectly decoded. (The probability of two or more points at exactly the 
same distance is readily seen to be zero and may be ignored. ) 
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The probability in the ensemble of codes of the cone of half-angle 6 
being empty is easily calculated. The probability that any particular 
code word, say code word 2 or code word 3, ete. is in the cone is given 
by 2(@)/Q(), the ratio of the solid angle in the cone to the total solid 
angle. The probability a particular word is not in the cone is 1 — Q(6)/ 
Q(). The probability that all 17 — 1 other words are not in the cone 
is [1 — 2(@)/Q(r)|"™ since these are, in the ensemble of codes, placed 
independently. The probability of error, then, contributed by situations 
where the point 1 is displaced by an angle from @ to @ + d@ is given by 
—(1/M){1 — [1 — 2(6)/Q(r)|"""}dQ(6). The total average probabil- 
ity of error for all code words and all noise displacements is then given 


by 
a | war"... 
ve = / 4 ] ~s l — Sa \ dQ(@) . (18) 
6=0 | Q( 1) 


/ 
This is an exact formula for the average probability of error P., for our 
random ensemble of codes. Since this is an average of P, for particular 
codes, there must exist particular codes in the ensemble with at least 
this good a probability of error, and certainly then P. op S Pe. 

We may weaken this bound slightly but obtain a simpler formula for 
calculation as follows. Note first that {1 — [2(0) /Q(r) |" " < 1 and 
also, using the well-known inequality (1 — x)" = 1 — nz, we have 
{1 — [1 — 2(0)/Q(r)]""} S (M — 1)[2(0)/Q(r)] S M[Q(0)/2(4)). 
Now, break the integral into two parts, 0 S @ S 6, and 6, S 6 S x. In 
the first range, use the inequality just given and, in the second range, 
bound the expression in braces by 1. Thus, 


is 2(0) , 
P, Ss - M dQ(@) — dQ(@) , 
0 Q(1) 70) 


61 
P. Ss - M i 2(6)dQ(0) + Q(A). 
Q(r) Jo 


It is convenient to choose for 6; the same value as appeared in the lower 
bound; that is, the 6, such that 2(0,)/Q(ar) = 1/M in other words, 
the 6, for which one expects one point within the @, cone. The second 
term in (19) is then the same as the lower bound on P, op obtained 
previously. In fact, collecting these results, we have 
se ' 
Q(6:) S Peo S QCA) — = / 2(6)dQ(6) , (20) 
Q(a4) Jo 

where MQ(0,) = Q(). These are our fundamental lower and upper bounds 
Gt Peer « 


We now wish to evaluate and estimate 2(6) and Q(@). 
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V. FORMULAS FOR RATE R AS A FUNCTION OF THE CONE ANGLE @ 


Our bounds on probability of error involve the code angle 6; such that 
the solid angle of the cone is 1/M = e "“ times the full solid angle of a 
sphere. To relate these quantities more explicitly we calculate the solid 
angle of a cone in n dimensions with half-angle 6. In Fig. 2 this means 
valculating the (n — 1)-dimensional area of the cap cut out by the cone 
on the unit sphere. This is obtained by summing the contributions due 
to ring-shaped elements of area (spherical surfaces in n — 1 dimensions 


de 


Fig. 2— Cap cut out by the cone on the unit sphere. 


of radius sin @ and of incremental width dé). Thus, the total area of the 
cap is given by 


(n — 1)x” 





1)/2 0; : 
2(0,) = / (sin 0)”“dé. 
r (" + ‘) 0 
) 


Here we used the formula for the surface S,(r) of a sphere of radius r 
. . : ’ n/2 n-l sq ‘ 
in n dimensions, S,(r) = nr"'r r(n/2 + 1). 


To obtain simple inequalities and asymptotic expressions for Q(@,), 
make the change of variable in the integral x = sin @,d@ = (1 — 2°) "dx. 
Let 2, = sin 6, and assume 6; < 2/2, so that x; < 1. Using the mean 
value theorem we obtain 


1/2 a 


ey = (Lay +2 (2 —m), — (22) 
(1 — a?)%? 


where 0 S @ S x,. The term a(1 — a’) “~ must lie in the range from 


2 3 2 . . . . . . . 
Oto xa(1 — a) since this is a monotone increasing function. Hence 
we have the inequalities 


's (1—27)™" 


Us ¢ T1. 
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Note that x — 2, is negative, so the correction term on the left is of 
the right sign. If we use these in the integral for 2(6,) we obtain 


(2 — %1)2 


n—1)/2 v1 
(n — 1)s | 2" ‘fa - oye + — =| dx 
r(" + ‘) 0 eee 
5) 


(n — 1)e** 





< 2(6,) < 
r n+ 
») 
n-1)/2 . n—-l 
T (sin 6; ) | ° 
: 1 — — tan’ a) 
n 


r (" . ') cos #6; 


= (2(0;) 





ae” (sin 0,)" 


r(" . ') eos 0; 


) is asymptotic to the expression on the right. 
C(n/2 + 1), henee, 





Therefore, asn > ~, Q(6, 


The surface of the unit 


fn ‘ n—l 

I * \) (sin 6) 
= (1 ~- —tan a) < 

. t1\ ip 

= r cos ™, 


n sphere is nr" 








1/2 
Tr cos Ay 


n es 
’ 2 (s \" 
ay 0; ) I (; + ') In 0, 
) 


Q(r) ~ (n+ 
ni = 


Replacing the gamma functions by their asymptotic expressions, we ob- 


sin” 0; 1 
/ Be! ; (28) 
V 2rn sin 6; cos 4; n 


tain 
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ah R . : R . la hl 
Thus e "" ~ sin" 6,/+/2xn sin 6; cos 6; and e ~ ~ sin 6. The some- 
what sharper expression for e 


nR 


must be used when attempting asymp- 
totic evaluations of P, , since P, is changed by a factor when 6, is changed 
by, for example, k/n. However, when only the reliability F is of inter- 
est, the simpler R ~ —log sin 6; may be used. 


VI. ASYMPTOTIC FORMULAS FOR Q(@) AND Q’(8) 


In Fig. 3, O is the origin, S isa signal point and the plane of the figure 
is a plane section in the n-dimensional space. The lines OA and OB 
represent a (circular) cone of angle @ about OS (that is, the intersec- 
tion of this cone with the plane of the drawing.) The lines OA’ and OB’ 
correspond to a slightly larger cone of angle @ + dé. We wish to estimate 
the probability —dQ,(@) of the signal point S being carried by noise 
into the region between these cones. From this, we will further calculate 
the probability Q,(@) of S being carried outside the @ cone. What is 
desired in both cases is an asymptotic estimate — a simple formula whose 





Fig. 3 Plane of cone of half-angle 6 
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ratio to the true value approaches 1 as n, the number of dimensions, 
increases. 

The noise perturbs all coordinates normally and independently with 
variance 1. It produces a spherical gaussian distribution in the n-di- 
mensional space. The probability density of its moving the signal point 
a distance d is given by 


(29) 


where dV is the element of volume. In Fig. 4 we wish to first calculate 
the probability density for the crosshatched ring-shaped region between 








Fig. 4 — Special value 6 . 
the two cones and between spheres about the origin of radius r and 
r + dr. The distance of this ring from the signal point is given by the 
cosine law as 

2 2 ‘ 1/2 ‘ 

d = (r + A’n — 2rAV/n cos 0)". (30) 

The differential volume of the ring-shaped region is r dr d@ times the sur- 
face of a sphere of radius r sin 6 in (n — 1)-dimensional space; that is 


(n 


1)a\” ?*(r sin 0)" 


(2) 7 (31) 
r(—— 


Hence, the differential probability for the ring-shaped region is 


| = 7 + A’n oe 9rA a/n cos 6) 
rn exp | ———— 
(1/29) a 


“ 


r dr dé (n — 


(n — 1)x-Y/2(r sin 6)"-? (32) 


r(" > ‘ r dr dé 
5) 


The differential probability —dQ of being carried between the two cones 
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is the integral of this expression from zero to infinity on dr: 


(n — 1) d0 
= Gi> = nF ecole! X 


2Qni2 ufs r (* +1) 


oo — 2 F ae or, ae s 7 _ 
| exp | wi il. : iV/n cos 2 (r sin 6)” “r dr. 
0 





9 


In the exponent we can think of A’n as A’n(sin’@ + cos’ @). The cos 
part then combines with the other terms to give a perfect square 

(r — AvV/n cos 6)” 
and the sin’ term can be taken outside the integral. Thus 

A’n sin? 0) 


(n — 1) exp | - > | (sin @)"-? dé 


— 





28 W/z I (2+) 


es (pr — / cos 6)" n 
[ exp | a : ae - | r"* dr. 
0 - 


We can now direct our attention to estimating the integral, which we 
call K. The integral can be expressed exactly as a finite, but complicated, 





—dQ = 


sum involving normal distribution functions by a process of continued 
integration by parts. We are, however, interested in a simple formula 
giving the asymptotic behavior of the integral as n becomes infinite. 
This problem was essentially solved by David and Kruskal,* who prove 
the following asymptotic formula as a lemma: 


co] =\? 
, 2 oe — (2 ae = 
| 2’ exp (—42° + 24+ lw) dz~ v2 (2) exp (47°)T, (35) 
J 


as vy —> o, wis fixed, T =[1 +4 (/w? + 4 — w)*]” and 
2=3Vv+lwt Vito t+ lw + »v. 


This is proved by showing that the main contribution to the integral 
is essentially in the neighborhood of the point Z where the integral is a 
maximum. Near this point, when » is large, the function behaves about 
as a normal distribution. 

The integral K in (34) that we wish to evaluate is, except for a multi- 
plying factor, of the form appearing in the lemma, with 


z2=f, w= A cos 8, v=n-—l1. 
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The integral then becomes 


A’n cos’ 6 a z 
K exp (- in ra ) | 2" exp -(; + 2A Yn cos 6 


An cos 0 
~ exp | — 


9 
» have 


4/n A cos 6 + +/1nA?2 cos? 6+ n — 1 
lar Pe A 
Vn | 3A cosé + V -co? 6+ 1 — 

a 1 n 


9 


7 
1A cos 6 / - ‘0OS" - 
SA co + 4 cos? 6 + | 








I l 
rE + O : 
cos? 6+ | “ 


2n /: , 


Letting 


G = 3|A cos 6 + V/ A? cos? 6 + 4], 


2= + /nG)1-—- ie ; 
i Om nG V A? cos? 6 + 4 n? : 


VnG\"" | 2 i 
= Ot. 
e ) E nG ~/ A® cos? 6 + 4 t (+) 
(; 
nG\"" mt I 
¢ ais G V/ A? cos? 6 + 4 


5° x ] | 2 
xp — = x i Gr — os 
— 2 ee E nG v/ A? cos? 6 + 4 . (+) 
= 2G 
~exp| onG — 2/A? cost 0 + 4 


G 
= eX In(1 AG cos 6) — ’ 
exp E v co V/ A? cos? 6 + i 


we have 
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since, on squaring G, we find G = 1 + AG cos 6. Collecting terms: 


¥ G n—l a 1 
K~T V28 (“2 ‘) e"'” exp ( 
, 





~~ GWA? cos? 6 + 4 
G 1°n 


os V/ A? cos? 6 + 4 7 2 


'Y (n—1)/2eyn n/2 nm 42 2 n ' 
= TV/2rn” ?’G"e exp (- 5 A’ cos’ 6 + = AG cos ) 





cos’ 6 + ; AG cos s) 


since a little algebra shows that the terms 
l G 
G/ A? co?0+4 ~WA?*cos?o+4 








in the exponential cancel to zero. The coefficient of the integral (34), 
using the asymptotic expression for I'[(n + 1)/2], is asymptotic to 


4 2 ‘ ‘ ‘ 
(sin? 8)(A2n)/2 n—2 (n+1)/2 


(n — lL)e sin 6” “¢ 


” ' "|, ee (41) 
2 *v/z(*t4) V/2e 


Combining with the above and collecting terms (we find that 7 
G/V1 + @): 


_ dQ 
dé 


(42) 


n— | | .2 
7 G sin 6 ex —- 14G cosé , 
Varn V1 + G rit _—— exp ( 5 + 4AG co )| 


This is our desired asymptotic expression for the density dQ/dé. 





As we have arranged it, the coefficient increases essentially as +/n 


i - . Er (0)n 
and there is another term of the form e ~“’"’", where 


; A , Aer 
E,(0) = “om 3AG cos 6 — log (G sin @). 


It can be shown that if we use for @ the special value 0) = cot ‘A (see 
Fig. 4) then £,(@)) = 0 and also £’,(@)) = O. In fact, for this value 


G(0.) = 4(A cos 0 + VA? cos? 6) + 4) : 


Se 
TV writ ') e 
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Hence the two terms in the logarithm cancel. Also 


2 2 ~~ A 


> — 2AG cos 6) = 


So £,(6)) = 0. We also have 


/ i v ( , 
E (0) = 4AG sin @ — 3AG cos 6 — 7 — cot 6. (43 ) 


° 


When evaluated, the term —G’/G simplifies, after considerable algebra, 
to 
A sin 6 
+/ A? cos? 6 + 4° 


Substituting this and the other terms we obtain 





ss A’ P A? cos? 6 sin 6 
E .(@) > sin 6 cos 6 + 1/ A? roe ra 
A (A*cos?@+ 4) . A sin 6 
— = sin 6 + = = 
$ ~/ A* cos?6+ 4 V/ A? cos? 6 + 4 











— cot @. 
Adding and collecting terms, this simplifies to 


v A ; 
E (6) - (A cos 6 + a/ A? cos? 6 + 4) sin é@ — cot 6 
AG sin 6 — cot 6 
t 0 A’ in’ @ + A ts [424 BF I 
co — SIN Sin A? = ° 
2 2 4 cos® 6 
Notice that the bracketed expression is a monotone increasing func- 
tion of 6(0 S @ S 7/2) ranging from —1 at 6 = Oto ~ at 6 = w/2. 
Also, as mentioned above, at 0), G = ese 0) and A = cot 4), so E’,( 0) 
0. It follows that E’,(0) < OforO0 < @ < @ and E£’,(6) > O for &% < 
6 < x/2. 
From this, it follows that, in the range from some 6; to /2 with 6; > 4, 
the minimum /,(@) will occur at the smallest value of 6 in the range, 
that is, at 6,. The exponential appearing in our estimate of, Q(@), 


E,(@)7 ; : : : 
ue" will have its maximum at 6; , for such a range. Indeed, 


namely, ¢ 
for sufficiently large n, the maximum of the entire expression (45) must 
occur at 6, since the effect of the n in the exponent will eventually 
dominate anything due to the coefficient. For, if the coefficient is called 
a(6) with y(@) = a(@) e "=, then 


y'(0) =e ‘mELOr _ a(@)nE’,(0) + a’(@)], (46) 
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and, since a(@) > 0, when n is sufficiently large y’(@) will be negative 
and the only maximum will occur at 6, . In the neighborhood of 6; the 
function goes down exponentially. 

We may now find an asymptotic formula for the integral 


r/2 
Q(0) = [aloe do + Q(x/2) 
6 

by breaking the integral into two parts, 


2/3 px/2 


Oit+n 
Q(0) = / a + Q(r/2). (48) 
91 


6, +n—2/8 


In the range of the first integral, (1 — e)a(@,) S a(@) S a(@)(1 + e), 
and ¢ can be made as small as desired by taking n sufficiently large. This 
is because a(@) is continuous and nonvanishing in the range. Also, using 
a Taylor’s series expansion with remainder, 


e LO _ exp | ~ni (0) — n(@ — 6;) BE’ (61) 
(@—6,)° 
oa 


9 


E” ,(6* |. 
where 6* is the interval 6, to 6. As n increases the maximum value of the 
. . 9\—4/3 p 
remainder term is bounded by n(n/2) EF’ max » and consequently ap- 
proaches zero. Hence, our first integral is asymptotic to 


3 


Oy+n-2 
a(6;) / exp [—nE,(0,) — n(@ — 6,)E’,(0,)] do 
61 


= — a(6,) exp [—nE,(6)] 


exp [—n(@ — See ‘alia 
nk’ ,(61) 


6, 
(0; )e nE 1,(6;) 
nh’ (0;) 


a 
~ - 


since, at large n, the upper limit term becomes small by comparison. 

mn F . 2/3 ’ : 

Che second integral from 6; + n to 2/2 can be dominated by the 
. . 2/3 ° ge 

value of the integrand at @,; + n multiplied by the range 


r/2—(@+n~*), 


(since the integrand is monotone decreasing for large n). The value at 
2/3 - : . 
A+ n is asymptotic, by the argument just given, to 


’ 2/3 J 
a(0,) exp [—nE,(0,) — n(n ~") E’,(6,)). 


This becomes small compared to the first integral [as does Q(/2) 
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®(—A) in (47)] and, consequently, on substituting for a(@,) its value 
and writing @ for 6, , we obtain as an asymptotic expression for Q(6): 


| sin @ exp (- . + 34AG cos0) | 
(6) ~ a 


Vn V1 + G@ sin é (AG sin? @ — cos 6) (51) 


(5 > 0 > 0% = cot ‘). 


This expression gives an asymptotic lower bound for P, op, obtained by 
evaluating Q(6) for the 6; such that WQ(@,;) = Q(1r). 
Incidentally, the asymptotic expression (51) can be translated into 








an asymptotic expression for the noncentral ¢ cumulative distribution 
by substitution of variables @ = cot '(t/+/f) and n — 1 = f. This may 
be useful in other applications of the noncentral ¢-distribution. 


Vil. ASYMPTOTIC EXPRESSIONS FOR THE RANDOM CODE BOUND 


We now wish to find similar asymptotic expressions for the upper 
bound on P, op: of (20) found by the random code method. Substituting 
the asymptotic expressions for dQ(6)d@ and for Q(6@)/Q(a) gives for an 


asymptotic upper bound the following: 


6) r(” + \) (sin 6)" ' 
me 2 


Q(0,) + « 





“0 ’ ti + l 1/2 4 
n- { - = Tr  cosé 


we f+ of ott 
[« sin 8 exp (- 5 is A tein | dd 
V1 + (7 sin” i] 


Thus we need to estimate the integral 








ef) ] 
w =| = 
; Jo cos @ sin’ 64/1 + G 


(53 ) 


“exp (A +- : Vx G cos 6 + log G + 2 log sin 0) lO . 


The situation is very similar to that in estimating Q(@). Let the coeffi- 
cient of n in the exponent be D. Note that D = —E,(@) + log sin @. 
Hence its derivative reduces to 


dD 


= —AG sin 0 + 2 cot 6. (54) 
dé 





> 
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dD/d@ = 0 has a unique root 6,0 < 6, S 2/2 for any fixed A > 0. 
This follows from the same argument used in connection with (45), the 
only difference being a factor of 2 in the right member. Thus, for 
6 < 6. , dD/dé is positive and D is an increasing function of 6. Beyond 
this maximum, D is a decreasing function. 

We may now divide the problem of estimating the integral W into 
cases according to the relative size of @. and 6; . 


Case I: 0. < €,. 


In this case the maximum of the exponent within the range of integra- 
tion occurs at 6;. Consequently, when n is sufficiently large, the maxi- 
mum of the entire integrand occurs at 6. The asymptotic value can be 
estimated exactly as we estimated Q(@) in a similar situation. The inte- 
gral is divided into two parts, a part from 6; — n ”* to 6, and a second 
part from 0 to 6, — n ~’. In the first part the integrand behaves asymp- 
totically like: 


] 
COs 4, sin’6, V1 + (7*(6;) 





exp (. -5 + : V s G(0,) cos 4; 


+ log G(@,) + 2 log sin 4, 
— (6 — 6,)[AG(6,) sin 6, — 2 cot 4] ), 


This integrates asymptotically to 
( P 1 / > 
exp(n| — ON + : V v (7(@,) cos 0; + log G(@,) + 2 log sin 4; (ce) 
él 2 l vb 


cos 6, sin’ 6, 1/1 + G2(@,) | —AG(6) sin 6; + 2 cot A;\n 





The second integral becomes small in comparison to this, being domi- 
nated by an exponential with a larger negative exponent multiplied by 
the range 6, — n ~°. With the coefficient 


a [Gt 
snes 


r( ; ) V 2 


and using the fact that 
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our dominant term approaches 


E sin 6; exp (-4 + 4AG cos a) 


V/nrV/1 + G sin 6,(2 cos 0, — AG sin? 6) 


Combining this with the previously obtained asymptotic expression 
(51) for Q(6,) we obtain the following asymptotic expression for the upper 
bound on Pe opt for 0 < 0 : 


(1 _ cos 6, — AG sin’ 6; ) 
2 cos 6, — AG sin? 6, 


: 4? n (58) 
E sin 6, exp (-4 + 3AG cos a) | 


Vnr V1 + G? sin 0(AG sin’ 6, — cos 6) 





Since our lower bound was asymptotic to the same expression without 
the parenthesis in front, the two asymptotes differ only by the factor 


cos 6, — AG sin’ 
(1 - 2m ia) 

2 cos 6; — AG sin? 6; 
independent of n. This factor increases as 6, increases from the value 4 , 
corresponding to channel capacity, to the critical value 6, , for which the 
denominator vanishes. Over this range the factor increases from 1 to «. 
In other words, for large n, P. opt is determined to within a factor. Fur- 
thermore, the percentage uncertainty due to this factor is smaller at 
rates closer to channel capacity, approaching zero as the rate approaches 
capacity. It is quite interesting that these seemingly weak bounds can 
work out to give such sharp information for certain ranges of the varia- 


bles. 


Case 2:6, > 6.. 


For 6; in this range the previous argument does not hold, since the 
maximum of the exponent is not at the end of the range of integration 
but rather interior to it. This unique maximum occurs at 6, , the root of 
2 cos 6. — AG sin’ 6, = 0. We divide the range of integration into three 
parts: 0 to@, — n " 6-—-n° t06+27 2/9 and 6. + n~* to @. Pro- 
ceeding by very similar means, in the neighborhood of 6, the exponential 
behaves as 


- 


( ea? ) 
exp (- n{E1(6.) + = 9%) pe .(0,) + O10 — ai), 
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The coefficient of the exponential approaches constancy in the small in- 
terval surrounding @, . Thus the integral (53) for this part is asymptotic 
to 

1 


cos 6, sin’ 6, V1 + (7? 





( ’ (@ — 0.) ., 1) 
f exp (— n Ex ) + = B00) | > 10 


] V/ 25 
~ e 3 “xX ~~ BE (6.) oo ° 
cos 6, sin’ 6. 1 + G satiate : V nk” »(6.) 





The other two integrals become small by comparison when n is large, by 
essentially the same arguments as before. They may be dominated by 
the value of the integrand at the end of the range near 6. multiplied by 
the range of integration. Altogether, then, the integral (52) is asymp- 
totic to 


l n{E 1 (0-)—R] . 
+/ rn cos 6, sin’ 6, VJ/1 + G? VE" 1(6-) " ; (00) 
The other term in (52), namely, Q(6,), is asymptotically small com- 
pared to this, under the present case 6 > 6, , since the coefficient of n 
in the exponent for Q(@) in (51) will be smaller. Thus, all told, the ran- 
dom code bound is asymptotic to 


] 
cos 6, sin’ 0.>/nrk” 1(6-){1 + G(6.)?| 





—n[E 1 (8¢)—R] 





(61) 


for @ > 6. or for rates R < R, the rate corresponds to @, . 

Incidentally, the rate R. is very closely one-half bit less than channel 
capacity when A 2 4, and approaches this exactly as A — «. For lower 
values of A the difference C — R, becomes smaller but the ratio C/R.— 4 
as A — 0. 


VIII. THE FIRM UPPER BOUND ON P, opt 


In this section we will find an upper bound, valid for all n, on the proba- 
bility of error by manipulation of the upper bound (20). We first find 
an upper bound on Q’(@). In Ref. 6 the integral (35) is transformed into 
2’ exp (—}2 + 24/v + 1 w) times the following integral (in their no- 


tation): 
en a a vy\ _y] 
U = / ¢:(y) exp, —-3y + r| in (1 + ‘) - | dy. 
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our dominant term approaches 


E sin 6, exp (-4 + 3AG cos a) 
V nr V1i+ @ sin 6;(2 cos 6; — AG sin? 6) 


Combining this with the previously obtained asymptotic expression 
(51) for Q(@,) we obtain the following asymptotic expression for the wpper 
bound on Pe opt for 0, < @ : 


(1 = cos 4; — AG sin’ 0; ) 
2 cos 6; — AG sin? 6, 


7 A’ n (58) 
G sin 6; exp | —-> + 32AG cos 6 
/ne V1 + G? sin 6,(AG sin’ 6, — cos 4) ‘ 


Since our lower bound was asymptotic to the same expression without 
the parenthesis in front, the two asymptotes differ only by the factor 


(1 _ cosh — AG sin’ 6 
2 cos 6; — AG sin? 6, 


independent of n. This factor increases as 6, increases from the value 4 , 
corresponding to channel capacity, to the critical value 6, , for which the 





denominator vanishes. Over this range the factor increases from 1 to «. 
In other words, for large n, P. opt is determined to within a factor. Fur- 
thermore, the percentage uncertainty due to this factor is smaller at 
rates closer to channel capacity, approaching zero as the rate approaches 
capacity. It is quite interesting that these seemingly weak bounds can 
work out to give such sharp information for certain ranges of the varia- 
bles. 


Case 2:0, > 6,. 


For 6; in this range the previous argument does not hold, since the 
maximum of the exponent is not at the end of the range of integration 
but rather interior to it. This unique maximum occurs at 6, , the root of 
2 cos 6. — AG sin’ 0, = 0. We divide the range of integration into three 
parts: 0 to@, — n 5 6. —-n** too. +n and & +n” tod. Pro- 
ceeding by very similar means, in the neighborhood of 6, the exponential 
behaves as 


9 


— 6,)° 
9 


exp (- n f,(0,) + (3 


E” ,(0.) + OL(@ — avi}. 
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The coefficient of the exponential approaches constancy in the small in- 
terval surrounding 6, . Thus the integral (53) for this part is asymptotic 
to 

1 
cos 6, sin’ 0.4/1 + @ 


( : (¢@—¢). te 
.f exp {— 7 | Blo.) + 7 (0) | 





1 V 29 
cos 6, sin’ 6. +/1 + @ 





exp [— nE,(0.)] ———=—- 
V nk” (0. ) 
The other two integrals become small by comparison when n is large, by 
essentially the same arguments as before. They may be dominated by 
the value of the integrand at the end of the range near @, multiplied by 
the range of integration. Altogether, then, the integral (52) is asymp- 
totic to 
ee (60) 
- 3 gee ") LW (ao \ ° y 
a/ mn cos 6, sin” 0.1 + G2 WE” ,(6-) 

The other term in (52), namely, Q(6,), is asymptotically small com- 
pared to this, under the present case 6 > 6, , since the coefficient of n 
in the exponent for Q(@) in (51) will be smaller. Thus, all told, the ran- 
dom code bound is asymptotic to 

] 
os 7 - oi « 
cos 6, sin’ 0.4/nrk” 1 (6 {1 + G(6.)?| 





: n[E 1 (0.)—R] 





(61) 


for 0 > 06. or for rates R < R, the rate corresponds to 0. . 
Incidentally, the rate R. is very closely one-half bit less than channel 
capacity when A 2 4, and approaches this exactly as A — ~. For lower 
ralues of A the difference C — R. becomes smaller but the ratio C/R. — 4 
as A — 0. 


VIII. THE FIRM UPPER BOUND ON P, op: 


In this section we will find an upper bound, valid for all n, on the proba- 
bility of error by manipulation of the upper bound (20). We first find 
an upper bound on Q’(@). In Ref. 6 the integral (35) is transformed into 
2" exp (—42° + 24/y + 1 w) times the following integral (in their no- 


tation ): 
. 2 \ ( a y l | 
U = g:(y) exp) —-2y +v|In{l+=)-—&= |; dy. 





THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1959 
y? /2 


636 

It is pointed out that the integrand here can be dominated by e 
This occurs in the paragraph in Ref. 6 containing Equation 2.6. There- 
fore, this integral can be dominated by +/2z, and our integral in (34) 


involved in dQ/dé@ is dominated as follows: 


a ( Bieta 08 2 , 
[ exp| - ; ive LT ‘dr 


e ‘ 
=\n-l 2 
Z 4 
< 1X 
sa () ” ( 


= 44/n (A cos 0 + WA? cos? 6 + 4 — 4/n) 


2 by this larger quantity gives 


2s n—l 2 2 
G G j ‘ 
(Ms ‘) exp ($ a weal ) J In. 
P 


We have 
teplacing 


We have, then, 
(n 1) ex (= int 0) | in 9)" 
n— exp sin? sin @)"-? 
dQ ; 2 (“ ey : 
a 
; 


= de ~ grit Wt r n+ 1 
9 
n@ An 2 
‘exp 5 cos 86 V 2k. 





) 


Replacing the gamma function by its Stirling expression 
n+1\"" n+ 1 po 
— exp = V 2n 
(which is always too small), and replacing [1 + (1/n)]"”" by +/2 (which 


is also too small) again increases the right member. After simplification, 


we get 
(n — 1)(G@sin @)" exp ") (—A? + 1 + AG cos 6) 
_ dQ < 9 


de ~ e joo 
Vn G sin’ 6 4/22 exp (: = ) 


(63) 


3/2 —E7,(8)n 
(n — 1)e“e “*™” 


/2an G sin’ 6 
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Notice that this differs from the asymptotic expression (42) only by a 
factor 
ePVIi+ < (i 
V2G 
(since G = 1). A firm upper bound can now be placed on Q(@): 
ae 7 
Q(6,) = / — (6 ( . 
Xs 6; do ‘ +0(5) 


We use the upper bound above for dQ/dé@ in the integral. The coefficient 
of —n in the exponent of e 


E,(0) = 4(A* — AG cos @) — log G sin 6 


is positive and monotone increasing with @ for 6 > 6), as we have seen 
previously. Its derivative is 


Ek’ .(0) = AG sin 6 — cot 6. 


As a function of @ this curve is as shown in Fig. 5, either rising mono- 
tonically from — * at @ = Oto A at @ = x/2, or with a single maxi- 
mum. In any case, the curve is concave downward. To show this analyti- 
cally, take the second derivative of £’,. This consists of a sum of negative 
terms. 











Fig. 5 — E,'(0) as a function of @. 
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Returning to our upper bound on Q, the coefficient in (63) does not 
exceed 


V/n 3/2 
mer 
V/ 26 
sin? 6, ’ 
replacing sin @ and G by sin @, and 1, their minimum values in the 
4 2 nE1(6) 
range. We now wish to replace e by 


exp—n[E,(0,) + (6 — @)hI. 


If h is chosen equal to the minimum £’,(@), this replacement will in- 
crease the integral and therefore give an upper bound. From the be- 
havior of £’,(@) this minimum occurs at either 6; or 7/2. Thus, we may 
take h = min [A, AG(@,) sin 6; — cot |. With this replacement the 
integral becomes a simple exponential and can be immediately inte- 
grated. 
The term Q(2/2) is, of course, 
l 2 
H-Adv/a) 5 re. 
V/2xn A 

If we continue the integral out to infinity instead of stopping at 1/2, the 
extra part added will more than cover Q(/2). In fact, EF, (4/2) = A’/2, 
so the extra contribution is at least 


Vn e” 


ae / 
An sin’ 6; 2r 





if we integrate 


3/2 
Vn e —A%n/2—n(0—0)) A 


omnes ¢ 
sin’ 6; V/ 29 


. : 1: 3/2, + 2 . 
to © instead of stopping at 7/2. Since e”*/sin’ 6; 2 1, we may omit 
the Q(/2) term in place of the extra part of the integral. 


Consequently, we can bound Q(6,) as follows: 
e'? exp {(n/2) [AG(0,) cos 6; — A? + 2 log G sin 4,}} 


Q(6,) S 3 : : 
wh) S /2xn sin’ 6; min (A, AG(6,) sin 6; — cot 6,) 


(64) 


In order to overbound P, opt by (3) it is now necessary to overbound 


| 


the term 


6; 


(6) 
Q(0 


dQ (9). 


1, 
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This can be done by a process very similar to that just carried out for 
fdQ(@). First, we overbound Q(6)/Q(6,) using (21). We have 


6 
2(6) - (sin x)" ~ dx 


aa) 7 
(61) | (sin x)" dx 
0 


f ais x)" dx 


ee * dx _ (sin x)" ” dx 


0 


er x)" cos x dx 
0 


" 4 
[ (sin 2)* * cos « dx + cos 0 | (sin 2)* * dx 


#0 


6 
_ \n—2 
| (sin 2)” “ cos x dx 
0 


’ 


6 (4 ’ 6; 
[ (sin x)" “cosa dx + [ (sin x)” “ cos « dx 


#0 “6 
and, finally, 


2(6) < (sin @)" 


——-$ S (65) 
Q(6;) (sin 6; )" 


Here the third line follows since the first integral in the denominator is 
reduced by the same factor as the numerator and the second integral is 
reduced more, since cos @ is decreasing. In the next line, the denominator 
is reduced still more by taking the cosine inside. 

Using this inequality and also the upper bound (63) on dQ/dé, we 
have 


4 
1 (0) 
| KO) dQ(6) < 


dé (66) 





00, /_: n ae er n/2 
| ' (sin @) — | )e®/2(G sin 6)"e 
o (sin 6)" V/2xnG sin’ 6 


6) 


(n — 1)e*? G" (sin 9)?" Fe" 244 0060) 
= : = (sin 6)” aia ; 

V2an (sin 6,) a" : 
Near the point 6, the integrand here behaves like an exponential when 
n is large (provided 6, < 6.), and it should be possible to find a firm 


dé. 
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upper bound of the form 


k E1,(8,)n 
= 
where k would not depend on n. This, however, leads to considerable 
complexity and we have settled for a cruder formulation as follows: 
The integrand may be bounded by its maximum values. If 6, < 6, , 
the maximum of the integrand will occur at 6, , at least when n is large 
enough. In this case, the integral will certainly be bounded by 


607(t) (ein &)* gO 4 CO 


The entire expression for P, op», may then be bounded by [adding in the 
bound (64) on Q(6;)] 


3/2 E,,(@,) ( 
ne he 8" | l ) 


Po >, (67) 





. i + : 
~ 4/2e sin’ 6, | no, min [A, AG(@,) sin 6; — cot A), 


It must be remembered that (67) is valid only for 6; < @ and if n is 
large enough to make the maximum of the integrand above occur at 6. 
lor 6; > 6, bounds could also be constructed based on the maximum 
value of the integrand. 


IX. A FIRM LOWER BOUND ON P, ops 


In this section we wish to find a lower bound on P, op that is valid for 
all n. To do this we first find a lower bound on Q’(@) and from this find 
a lower bound on Q(@). The procedure is quite similar to that involved 
in finding the firm upper bound. 

In Ref. 6, the integral (35) above was reduced to the evaluation of 
the following integral (Equation 2.5 of Ref. 6): 


LC + ") exp (- Ly—y ) ay 
by + »| i (1 + ¥) a 
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Here we used the inequality 
1 ” y" P 1 
In (1 + ‘) —: — y for y > 0, 
z 2 22? z 
and also the fact that »/2° < 1. This latter follows from Equation 2.3 of 
Ref. 6 on dividing through by 2. 
Using this lower bound, we obtain from (34) 


( — ) sin” . > ons 
dQ > n l In 0 exp ( > 


{ *) 
s\"-! 5" / 
= (?) exp ( . (68) 
d0 ni2_ 7 n+ 1 e ‘ ‘ 
28 r( 5 ) 


Now 2 = Un — 1 Gand 


{n+ 1 ~~ ——e = l 
r(2 )</( ) Vie ex | aT) 


9 ») 


9 





and, using the fact that 
n— +) 2. 
m 

(" + 1 


we obtain 





dQ I V/n—le'« 

do ~ 6V2x G | + 
1 eX sin’ 6 
dew |S + ae in 


This is our lower bound on dQ/dé. 

To obtain a lower bound on Q(@) we may use the same device as 
before—here, however, replacing the coefficient by its minimum value in 
the range and the exponent by —nF,(@;) — n(@ — @)E'r max : 

Lk’, = AG sin @ — cot @ 
< AG 
<= A(A +1). 


Similarly, in the coefficient, G can be dominated by A + 1 and sin’é@ by 
1. Thus, 


Q(6,) = 


° ofa _ ie’ 2 nE 1 (01) , . ‘ . (70) 
7 t+ Q(-). © 





(ATI) 1 l 


oe V2n(A + 1) exp| 44 én + 1) 
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Integrating and observing that the term due to the 2/2 limit can be 
absorbed into the Q(a/2) — erf A, we arrive at the lower bound: 


po elt, mB LO) 
2rn (A +1)° exp| 


Q(6,) = 





(A + 1)" = (71) 
2 6(n + 1) 


X. BEHAVIOR NEAR CHANNEL CAPACITY 


As we have seen, near channel capacity the upper and lower asymp- 
totic bounds are substantially the same. If in the asymptotic lower 
bound (42) we form a Taylor expansion for @ near 6, retaining terms 
up to (@ — @)°, we will obtain an expression applying to the neighbor- 
hood of channel capacity. Another approach is to return to the original 
noncentral ¢-distribution and use its normal approximation which will 
be good near the mean (see Ref. 5). Either approach gives, in this neigh- 
borhood, the approximations [since E(@)) = E’(@)) = 0): 


2 (42 2 
_ MQ. vn + A‘) exp] —n (4 + 1) (0 - 0)*| 
dd W/n/2+ A? A?+2 
Att 1_ 
Q(@) = | (a — 6) Vaid 5 Vin |, 


° : . a a ° 
or, since near channel capacity, using e ~ = sin 6, 





6—6 = A (C — R) 


/¥ re Pe oe 
Poses (n, x, P) 2 o| vans Vary (PR 0] (73) 


P+N 
= ,) 4 om Hy 
: l ee + 2N) ¥ (an (R |. 


The reliability curve is approximated near C by 


’ (P+N)’ ,. ‘ 
Rk) & oe c+ Ry. 7 
E(R Pip IN) | i (74) 





It is interesting that Rice” makes estimates of the behavior of what 
amounts to a lower bound on the exponent F# near channel capacity. 
His exponent, translated into our notation, is 

r+ N 2 


E*(R) = —>pP (C — B), 


a poorer value than (74); that is, it will take a larger block length to 
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achieve the same probability of error. This difference is evidently due to 
the slight difference in the manner of construction of the random codes. 
Rice’s codes are obtained by placing points according to an n-dimen- 
sional gaussian distribution, each coordinate having variance P. In our 
codes the points are placed at random on a sphere of precisely fixed ra- 
dius ~/nP. These are very close to the same thing when n is large, 
since in Rice’s situation the points will, with probability approaching 1, 
lie between the spheres of radii +/nP (1 — €) and -~W/nP (1+ e), (any 
« > 0). However, we are dealing with very small probability events in 
any case when we are estimating probability of error, and the points 
within the sphere are sufficiently important to affect the exponent Z. In 
other words, the Rice type of code is sufficient to give codes that will 
have a probability of error approaching zero at rates arbitrarily near 
channel capacity. However, they will not do so at as rapid a rate (even 
in the exponent) as can be achieved. To achieve the best possible E it 
is evidently necessary to avoid having too many of the code points in- 
terior to the +/nP sphere. 

At rates R greater than channel capacity we have @; < 6). Since the 
@ distribution approaches normality with mean at @) and variance 
2n(A® + 1)°/(A® + 2), we will have Q(0@,) approaching 1 with in- 
creasing n for any fixed rate greater than C. Indeed, even if the rate R 
varies but remains always greater than C (perhaps approaching it from 
above with increasing n), we will still have P. op > 3 — ¢ for any e > 0 
and sufficiently large n. 


XI. UPPER BOUND ON Fs opt BY METHOD OF EXHAUSTION 


For low rates of transmission, where the upper and lower bounds di- 
verge widely, we may obtain better estimates by other methods. For 
very low rates of transmission, the main contribution to the probability 
of error can be shown to be due to the code points that are nearest to- 
gether and thus often confused with each other, rather than to the gen- 
eral average structure of the code. The important thing, at low rates, is 
to maximize the minimum distance between neighbors. Both the upper 
and lower bounds which we will derive for low rates are based on these 
considerations. 

We will first show that, for D < +/2 nP, it is possible to find at least 


ee 
M,y = sin sin 9 /nP 


n 


points on the surface of an n sphere of radius ~/nP such that no pair 
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of them is separated by a distance less than D. (If Mp is not an integer, 
take the next larger integer.) The method used will be similar to one 
used by E. N. Gilbert for the binary symmetric channel. 

Select any point on the sphere’s surface for the first point. Delete from 
the surface all points within D of the selected point. In Fig. 6, x is the 
selected point and the area to be deleted is that cut out by the cone. 
This area is certainly less (if D S +/2nP) than the area of the hemi- 
sphere of radius H shown and, even more so, less than the area of the 
sphere of radius H. If this deletion does not exhaust the original sphere, 
select any point from those remaining and delete the points within D 
of this new point. This again will not take away more area than that of 
a sphere of radius H, Continue in this manner until no points remain. 
Note that each point chosen is at least D from each preceding point. 
Hence all interpoint distances are at least D. Furthermore, this can be 
continued at least as many times as the ratio of the surface of a sphere 
of radius ~/nP to that of a sphere of radius H, since each deletion takes 
away not more than this much surface area. This ratio is clearly 


(WnP/H)" 
By simple geometry in lig. 6, we see that H and D are related as fol- 
lows: 


; H 
sin @ 53 


VnP’ 


a D 
| r a 
. 2 2~W/nP 
Hence 
iH / P M4 ) . 1 ~ 
Vin Sl 2 SIN 2V/nP* 


Substituting, we can place at least 


D 
Mp («in yo sin ! UB) n—1) 
2—UnP 


points at distances at least D from each other, for any D S V/2nP. 
If we have Mp points with minimum distance at least D, then the proba- 
bility of error with optimal decoding will be less than or equal to 


To show this we may add up pessimistically the probabilities of each 
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point being received as each other point. Thus the probability of point 
1 being moved closer to point 2 than to the original point 1 is not greater 
than ¢[—D/(2+x/N )], that is, the probability of the point being moved 
in a certain direction at least D/2 (half the minimum separation). 
The contribution to errors due to this cause cannot, therefore, exceed 
(1/M,)®[—D/(2/N)], (the 1/M>» factor being the probability of mes- 
sage 1 being transmitted). A similar argument occurs for each (ordered) 
pair of points, a total of Mp(.M>p — 1) contributions of this kind. Con- 
sequently, the probability of error cannot exceed (J, — 1)4[—D 
(2+/N )] or, more simply, 1 ,4[—D/(2V/N )}. 


If we set 
’ D —(n—1) 
M,y=(smn2sn -—— 
( 2 V =) 


then the rate PR (in natural units) is 


r=(1 ay in 2 sin S75) 
i _, ) log {sin 2 sin 2 nP 


> —D mR JIN —(D2/8N) 


using the well-known upper bound @(—2) < (1/x+/2zr)e * '~. These are 
- Vv 


HEMISPHERE OF 
RADIUS H 





Fig. 6 — Geometry of sphere of radius VnP. 
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parametric equations in terms of D. It is more convenient to let 


D = »WV/2nP. 


r —1 
(1 ~ ‘) log (sin 2 sin’ 5) ’ 
n be 


1 n{R—(A2 P)/(4N)] 
e 


a 4/ mk 


We then have 


The asymptotic reliability, that is, the coefficient of —n in the expo- 
nent of P,, is given by (X°P/4N) — R. This approaches 


1x2 P 


(sin } sine’) oN 7 R as n— 0, 


Thus our asymptotic lower bound for reliability is (eliminating \): 


> 
. ps —s 
E = (sin } sine )° — — 
‘ 2N 


R. (78) 
As R — 0 the right-hand expression approaches P/(4N). 

This lower bound on the exponent is plotted in the curves in Section 
XIV and it may be seen to give more information at low rates than the 
random code bound. It is possible, however, to improve the random 
coding procedure by what we have called an “expurgating’’ process. It 
then becomes the equal of the bound just derived and, in fact, is some- 
what stronger over part of the range. We shall not go into this process 
in detail but only mention that the expurgating process consists of 
eliminating from the random code ensemble points which have too close 
neighbors, and working with the codes that then remain. 


XII. LOWER BOUND ON Fi IN GAUSSIAN CHANNEL BY MINIMUM DISTANCE 
ARGUMENT 


In a code of length n with M code words, let mj(7 = 1, 2,--- , MW; 
s = 1,2,---,n) be the sth coordinate of code word 7. We are here as- 
suming an average power limitation P, so that 


1 . ” 
>, me ee oe (79) 


nM 4: 


We also assume an independent gaussian noise of power N added to each 
coordinate. 
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We now calculate the average squared distance between all the 


M(M — 1)/2 pairs of points in n-space corresponding to the M code 
words. The squared distance from word 7 to word j is 


> (Mis — Mj)’. 
8 


The average D? between all pairs will then be 


- 1 : 
sail M(M — 1) p» (mis — mis)”. 


Note that each distance is counted twice in the sum and also that the 
extraneous terms included in the sum, where 7 = j, contribute zero to it. 
Squaring the terms in the sum, 


7 1 r eu | 
asies M(M — 1) 7 Mis 2 Zz i MisM jg + >> Mis ) 


jee 8 i) ijs 


l 


2 ne “ys 
M(M — 1) [2M »» Mis - > (d mis) | 


1 = — 
M(M — 1) 
2nM P 


ie ie ‘ 
ed 


(80) 
2MPnM 


where we obtain the third line by using the inequality on the average 
power (79) and by noting that the second term is necessarily non- 
positive. 

If the average squared distance between pairs of points is 


<(2nMP)/(M — 1), 


there must exist a pair of points for whose distance this inequality holds. 
Each point in this pair is used 1/M of the time. The best detection for 
separating this pair (if no other points were present) would be by a 
hyperplane normal to and bisecting the joining line segment. Either 
point would then give rise to a probability of error equal to that of the 
noise carrying a point half this distance or more in a specified direction. 
We obtain, then, a contribution to the probability of error at least 


ay a 1, /2nMP | 
- a in a certain direction 2 = a 
| a M = 1) 


_1,f_,/_ MP 
~ M V (M —12N}’ 


M 


\ 
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This we may assign to the first of the two points in question, and the 
errors we have counted are those when this message is sent and is re- 
ceived closer to the second message (and should therefore be detected 
as the second or some other message ). 

Now delete this first message from the set of code points and consider 
the remaining 17 — 1 points. By the same argument there must exist 
among these a pair whose distance is less than or equal to 


/2nP(M — 1) 
(M — 2) 


This pair leads to a contribution to probability of error, due to the first 
of these being displaced until nearer the second, of an amount 


ls ahd (M — 1)nP 
ML V (M—2)2N 
This same argument is continued, deleting points and adding contribu- 


tions to the error, until only two points are left. Thus we obtain a lower 
bound on P, opt as follows: 


aP 2 eo 
i if — ) r(-4 oN M — 3) 


6-01) 


To simplify this bound somewhat, one may take only the first 17/2 terms 


(SL) 


lor (M + 1)/2 if M is odd]. Since they are decreasing, each term would 
be reduced by replacing it with the last term taken. Thus we may reduce 


~ oe =). (82) 
— 22N 


For any rate R > 0, as n increases the term M/(M — 2) approaches 1 


the bound by these operations and obtain 


and the bound, then, behaves about as 
] 6 /nP 
3°\—V on): 


nP 4N) 


This is asymptotic to 


>, /mnP~ 
24/ Me 


It follows that the reliability E < P/(4N) = A*/4. This is the same 
value as the lower bound for F when R — 0. 
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XIII. ERROR BOUNDS AND OTHER CONDITIONS ON THE SIGNAL POINTS 


Up to now we have (except in the last section) assumed that all sig- 
nal points were required to lie on the surface of the sphere, i.e., have a 
mean square value ~/nP. Consider now the problem of estimating 
P’. op (M, n, P/N), where the signal points are only required to lie 
on or within the spherical surface. Clearly, since this relaxes the condi- 
tions on the code, it can only improve, i.e., decrease the probability of 
error for the best code. Thus P’, opt S Pe opt - 

On the other hand, we will show that 


ee (4, ns W/ r) > Pins (1, n thy Ay (83) 


In fact, suppose we have a code of length n, all points on or within the 





0.14 - 








A=\/4 


Cc sg a | 





> 


Fig. 7 Curves showing Ey, vs. FP for . 
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n sphere. To each code word add a further coordinate of such value that 
in the n + | space the point thus formed lies exactly on the n + 1 sphere 
surface. If the first n coordinates of a point have values 2, ®2, +++ , nr 
with 

n 

> x => nP, 

= 
the added coordinate will have the value 

n 


Lay = '/ (n+ 1)P —- 2 


i=l 
This gives a derived code of the first type (all points on the n + 1 
YI I 
sphere surface) with M words of length n + 1 at signal-to-noise ratio 
P/N. The probability of error for the given code is at least as great as 


that of the derived code, since the added coordinate can only improve 
































Fig. 8 — Curves showing Ey, vs. different values of R for A = 1 and 2. 
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Fig. 9 — Curves showing Ey, vs. different values of R for A = 3. 


the decoding process. One might, for example, decode ignoring the last 
coordinate and then have the same probability of error. Using it in the 
best way would, in general, improve the situation. 

The probability of error for the derived code of length n + 1 must be 
greater than or equal to that of the optimal code of the length n + 1 
with all points on the surface. Consequently we have (83). Since 
Prop (M, n, P/N) varies essentially exponentially with n when n is 
large, the effect of replacing n by n + 1 is essentially that of a constant 
multiplier. Thus, our upper bounds on P, o»¢ are not changed and our 
lower bounds are multiplied by a quantity which does not depend much 
on n when n is large. The asymptotic reliability curves consequently 
will be the same. Thus the F curves we have plotted may be applied in 
either case. 


Now consider the third type of condition on the points, namely, that 


the average squared distance from the origin of the set of points be less 
than or equal to nP. This again is a weakening of the previous conditions 
and hence the optimal probability of error, P”. opt , is less than or equal 
to that of the previous cases: 


a ug Pe P ‘ P 
Pr cunt Mt, ny SF gue n> SP ct, mx )- (84) 
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Fig. 10 — Curves showing FE, vs. different values of R for A = 4. 


Our upper bounds on probability of error (and, consequently, lower 
bounds on reliability) can be used as they stand. 

Lower bounds on P”, 6), may be obtained as follows. If we have M 
points whose mean square distance from the origin does not exceed nP, 
then for any a(0 <a@ S 1) at least ai of the points are within a sphere 
of squared radius nP/(1 — a). |For, if more than (1 — a)M of them 
were outside the sphere, these alone would contribute more than 


(1 — a)MnP/(1 — a) 


to the total squared distance, and the mean would then necessarily be 
greater than nP.| Given an optimal code under the third condition, we 
can construct from it, by taking aM points within the sphere of radius 
VnP/1 — a, a code satisfying the second condition with this smaller 
number of points and larger radius. The probability of error for the new 
code cannot exceed 1/a times that of the original code. (Each new code 


word is used 1 /a@ times as much; when used, its probability of error is at 
least as good as previously.) Thus: 
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Pp". opt (0, n, /*) P- . P’, opt (ca, n, VV y j P x) 
o si — a) 
l 


> ; 2 
- I e opt (aM, n + l, / ( l 7 wy) ; 


XIV. CURVES FOR ASYMPTOTIC BOUNDS 


Curves have been calculated to facilitate evaluation of the exponents 


in these asymptotic bounds. The basic curves range over values of 




















Fig. 11 Curves showing F, vs. different values of R for A = 8 and 16. 
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‘a 











// 


A 


Fig. 12 — Channel capacity, C, and critical rate, R, , as functions of 6. 


A k 42> 3, 1, 2, 3, 4, 8, 16. Figs. 7 through 11 give the coefficients 
of n and F, as functions of the rate R. Since F, strictly is a funetion of 
6, and the relation between 6 and R depends somewhat on n, a number 
of slightly different R scales are required at the bottom of the curve. 
This, however, was considered a better means of presenting the data 
than the use of auxiliary curves to relate R and 6. These same curves 
give the coefficient of n in the upper bounds (the straight line part to- 
gether with the curve to the right of the straight line segment ). The point 
of tangency is the critical R (or critical 6). In other words, the curve and 
the curve plus straight line, read against the n = © scale, give upper 
and lower bounds on the reliability measure. The upper and lower bounds 
on E for low R are also included in these curves. The upper bound is the 


horizontal line segment running out from R = 0, FE = A*/4. The lower 


bound is the curved line running down from this point to the tangent 
line. Thus, the reliability # lies in the four-sided figure defined by these 
lines to the left of R, . It is equal to the curve to the right of R, . Fig. 12 
gives channel capacity C and the critical rate R, as functions of 6. For 
A very small, the £,(R) curve approaches a limiting form. In fact, if 
6 = (r/2) — «, with e€ small, toa close approximation by obvious expan- 
sions we find 


E.R) = A Ae+- and R 


» 
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Fig. 13 — Plots of E,(R)/A? against R/A?. 


Kliminating e, we obtain 


Fig. 13 plots E,(R)/A° against R/A°. 


XV. ACKNOWLEDGMENTS 


I am grateful to several people for help in preparing this paper. Mrs. 
Judy Frankman computed the curves of £, and other members of the 
Center for Advanced Study in the Behavioral Sciences were helpful in 
many ways. The referee made several valuable suggestions which have 
been incorporated in the paper. Finally, | am particularly indebted to 
my wife Betty for checking much of the algebra involved in the asymp- 
totic bounds. 


REFERENCES 


1. Shannon, C. E., Communication in the Presence of Noise, Proc. I.R.E., 87, 
January 1949, p. 10, 





656 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1959 


9 


9 
0 


4 


5 


6. 


Rice, 8. O., Communication in the Presence of Noise — Probability of Error 
for Two Encoding Schemes, B.S.T.J., 29, January 1950, p. 60. 

Elias, P., in Information Theory (Cherry, C., ed.), Academic Press, New York, 
1956 

Shannon, C. E., Certain Results in Coding Theory for Noisy Channels, Inform. 
and Cont., 1, September 1957, p. 6. 


Johnson, N. L. and Welch, B. L., Applications of the Noncentral ¢-Distribution, 


Biometrika, 31, 1939, p. 362 
David, H. T. and Kruskal, W. H., The WAGR Sequential ¢-Test Reaches a De 
cision with Probability One, Ann. Math. Stat., 27, September 1956, p. 797 





Analysis of Phonon-Drag 
Thermomagnetic Effects 
in n- Type (Germanium* 


By C. HERRING, T. H. GEBALLE and J. E. KUNZLER 


(Manuscript received November 12, 1958) 


A study has been made of the Nernst effect and the variation of thermoelec- 
tric power with magnetic field for single-crystal samples of n-type germanium 
of various orientations and impurity concentrations, at fields up to 18,000 
gauss and temperatures from 275° to GO°K and below. Except at the highest 
temperatures, both effects arise predominantly from that part of the thermo- 
electric power which is due to phonon drag. All observations can be quanti- 
tatively accounted for by theory. They yield information about the dependence 
of the relaxation times for phonon-phonon scattering on the frequency of 
the phonons, and establish with some certainty the conclusion that four- 
phonon collisions are much less important than three-phonon collisions in 
the pertinent range of temperatures and phonon frequencies. Auxiliary in- 
vestigations have shown that the quantization of electron orbits in a magnetic 
Jield has only a small effect on transport properties when the cyclotron level 
spacing is less than thermal energy. The mean free path of electrons is shown 
to be energy-independent, as acoustic-scatlering theory predicts. The absolute 
mobility can be predicted to within 10 per cent or better from data on the 
fractional changes of resistance with stress and magnetic field. 

A more detailed summary of the conclusions and implications of the pres- 
ent work is given in Section 1X. 


I. INTRODUCTION 


In a recent paper' we have presented measurements of the Nernst 
effect and the change of thermoelectric power in a magnetic field for 
n-type germanium of high purity. These two effects were shown to be 

* This paper, though complete in itself, constitutes part II of a study, part I of 
which was presented in Ref. 1. In addition to describing the phenomena and the 


physical principles underlying them, Ref. 1 develops the background of the present 
study and gives details of the measuring techniques. 
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compounded out of electron-diffusion and phonon-drag contributions, 
the latter being the predominant one in the range near liquid-air tem- 
perature. Now the phonon-drag phenomenon’? — the pushing of charge 
carriers from hot to cold by the asymmetric phonon distribution which 
a thermal gradient produces — depends on the details of both the elec- 
tron-phonon interaction and the processes which scatter phonons. The 
anisotropies of these interactions and their dependences on wave number 
enter in different ways into the several thermomagnetic quantities which 
one can measure. Thus, an analysis of thermomagnetic data should 
make it possible to sort out the different factors involved in phonon 
drag, and to obtain information not previously available about electron- 
phonon and phonon-phonon interactions. In our first paper we used 
qualitative arguments to draw a number of semiquantitative conclu- 
sions about these interactions. Our object in the present paper is to 
make these conclusions as quantitative as possible, by comparing the 
observations with explicit theoretical formulas. 


1.1 Basic Concepts 


" ° . . 1 ‘ 

Following our previous practice, we shall formulate the theory in 
terms of the Peltier tensor M1 of the electrons, which is related to the 
thermoelectric power tensor Q in a magnetic field H by 


Tsa( —H) 
7 , 


Q.s(H) a (1) 


Here, I is defined as the tensor relating the energy flux F, relative to 
the Fermi level, to the current density j: 
F = II - j. (2) 


In terms of the Q-tensor, the change of thermoelectric power in the a 
direction, due to H, is 


AQ = Qaa(H) — Qaa(0). (3) 


The Nernst coefficient B(H) is defined — for the symmetrical orienta- 
tions considered here — by 


E, = —BH X VT, (4) 


where Ey is the open-circuit field transverse to the temperature gradient. 
We shall obtain B from (1) and the relation 


BH = -Q,., (5) 


which applies if V7' is in the x-direction and H in the z-direction. 
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The model which we shall use as a basis for our calculations is the 
“electron-group” model described in our previous paper.’ In this, the 
possible states of motion of the charge carriers are divided into a num- 
ber of groups, g, and the assumption is made that for each such group 
the distribution function over the group is determined by the contribu- 
tion, j,, which the states of this group make to the current density. 
Carriers of all groups are, of course, subject to the same electric and 
magnetic fields. 

For a multivalley semiconductor like n germanium, a natural choice 
is to let a group g consist of the states in an ellipsoidal shell of the Bril- 
louin zone, of energy range ¢ to e + de, in a particular valley. Each such 
group g has its characteristic Peltier tensor I, ; this must be independ- 
ent of H if, as we are assuming, H affects the j,’s of the different groups 
but not the distribution function for a given j, . The total Peltier tensor 
of the medium can be compounded out of those of the different groups. 
Explicitly, 


i(H) = 2%], - 6,(H) - o0(H), (6) 


where 6, is the conductivity tensor of group g and 9 = (2,6,) ’ is the 
total resistivity tensor. 

For any one of the ellipsoidal shells just mentioned, the Peltier tensor 
II, must have the symmetry of the valley, which for n germanium is 
axial symmetry about a [111]-type direction. Thus, Ml, is describable in 
terms of its two principal components, I)\(¢), along this direction (the 
high-mass direction ), and IT,(¢), normal to it. Each of these components, 
in turn, is a sum of an electron-diffusion term (subscript e) and a phonon- 
drag term (subscript p). Here, I.y = I.. and is a linear function of e, 
while II,)(€) # I,1(¢). Our task is to relate these functions of energy 
explicitly to the low- and high-field values of the Nernst coefficient B 
and the change AQ in the thermoelectric power. 


1.2 Qualitative Properties of 11,, and Up. 


As we formulate the detailed theory in the following sections, it will 
be helpful to keep in mind two qualitative conclusions which can be 
drawn from our earlier study.’ The first of these concerns the anisotropy 
ratio of the phonon-drag Peltier coefficient. It is that 


* 
> 1 but seer | (7) 


* 


where m*), m*, are the principal effective masses and 7), 7, are the 
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acoustic-seattering relaxation times® for current contributions in the 
corresponding directions. A large anisotropy of this sort is to be expected, 
since, if all the phonon modes had the same velocity and relaxation time 
Tpr , the ratio II,)/II,. would be the ratio of the rates of crystal-momen- 
tum loss to the lattice for unit currents in the high-mass and low-mass 
directions, and this is just m*)7,/m* 7 

Our second conclusion had to do with the energy dependence of II, 
and II,,. All the quantities which we have measured correspond to 
various kinds of weighted averages of II, and Il, . A typical such aver- 
age of II,, and II, , considered as a function of energy ¢, decreases as 


€ increases, at a rate distinctly slower than ¢ ’, This was shown to cor- 


respond to a frequency dependence of the relaxation time 7,, of the 
. : 1 2 

phonon modes intermediate between w and w ~ and nearer to the 

former. 


1.3 Program 


In Section IT and Appendix A we shall discuss critically the adequacy 
of the electron-group model, reporting several auxiliary experimental 
and theoretical investigations which bear on this question. After estab- 
lishing that the model should be quite good for pure n germanium at 
liquid-air temperature and above, we shall describe in Section III the 
procedures used for deriving the formulas for B and AQ. The explicit 
derivations will be given in Appendix B in a form applicable to any 
multivalley medium (e.g., n silicon); the formulas we need for the pres- 
ent application will be summarized in Section IV. In Section V we shall 
present the raw experimental data and discuss the corrections which 
need to be applied to them before comparing them with the theory. In 
Sections VI and VII we shall make the comparison for temperatures 
in the range 60° to 131°K, with the object of deducing from it as much 
information as possible about the functions II,(e) and II,i(e). In 
Section VIII we shall discuss various further observations, such as 
the behavior of B at high fields and the thermomagnetic effects above 
200°K. These observations, which for experimental or theoretical reasons 
are less amenable to precise analysis, still are in good accord with the 
conclusions previously drawn, and allow them to be slightly extended. 
Section LX will summarize the arguments and give a few remarks on 
the significance of the conclusions for the theories of electron-phonon 
and phonon-phonon interactions, topics which we hope to treat more 
fully in later publications. 
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II. CRITIQUE OF THE ELECTRON-GROUP MODEL 


For a multivalley semiconductor the electron-group model, described 
above and in our earlier paper,! is based on two assumptions: (a) that 
the distribution function in any particular ellipsoidal energy shell can 
be approximated by a linear function of the crystal momentum over the 
shell and (b) that this distribution function is determined only by the 
external fields, being independent of the distribution functions of other 
shells. We must first examine critically the adequacy of this model; then 
we shall consider further specializations of it. 

The adequacy of the electron-group model depends on the answers to 
the following questions: 

i. Is it legitimate to use a transport equation in crystal-momentum 
space? 

ii. How accurately can one separate the transport problems for differ- 
ent energy shells? 

ili. How well is the distribution function in a single shell approximated 
by a linear function of the crystal momentum? 

As regards i, there are two things which might undermine the validity 
of a transport equation in crystal-momentum space alone — inhomo- 
geneities and orbital quantization. Distortion of the current distribution 
by inhomogeneities in the specimen may make it necessary to consider 
the transport problem in position as well as in velocity space. This 
trivial but annoying difficulty in the interpretation of experiments will 
be discussed at length by one of the authors in another place;* it causes 
the transverse magnetoresistance to fail to saturate as H — «, but has 
very little effect on thermoelectric and thermomagnetic properties. The 
reason for the difference is simple. When a current flows, portions of the 
specimen with different carrier concentrations try to set up different 
Hall fields, and can only adjust to one another by a distortion of the 
current lines, which becomes large as H — «x. Thermoelectric and ther- 
momagnetic effects, on the other hand, are measured at zero average 
current, and set up fields which are much less sensitive to inhomogenei- 
ties than the Hall field is. The fields also remain finite as H — «, whereas 
the Hall field does not. For this reason, and because measurements on 


different pairs of side-arms showed the gross homogeneity of our speci- 


mens to be quite good, we believe the effect of inhomogeneity on our 
results to be negligible. 

A less trivial objection to the use of a transport equation in erystal- 
momentum space is the fact of orbital quantization, which is known to 
make the high-field magnetoresistance qualitatively different from the 
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predictions of a transport theory formulated in crystal-momentum 
space.’ It can be argued theoretically that this is not serious until the 
spacing, iw, , of the cyclotron levels becomes comparable with kT’; how- 
ever, for n germanium with 18,000 gauss along a [100] axis, hw, corre- 
sponds to kT at about 25°K, and one may wonder whether this is negli- 
gible for experiments at 77°. Fortunately, theoretical and empirical 
evidence, which we have gathered together in Appendix A, indicates 
that for our specimens the effect of orbital quantization on the resistance 
and thermoelectric power is probably not more than 5 per cent or so at 
18,000 gauss and 77°, and that for many orientations it is probably con- 
siderably less. The main arguments can be summarized as follows: 

1. The theory of longitudinal magnetoresistance,’ when interpreted 
with allowance for the energies of the phonons which do the scattering,” 
predicts only rather small departures from the unquantized theory as 
long as iw, < kT’. Observations” support this conclusion. 

2. Both for longitudinal and transverse cases the phonon-drag part 
II,, of the Peltier coefficient is proportional to the product of the resistiv- 
ity, the average relaxation time of the phonon modes which scatter the 
electrons, and the average squared velocity of these modes. As the effect 
of orbital quantization on the average relaxation time and velocity is 
probably rather less than on the resistivity, the quantization effect should 
be of similar magnitude for electrical and for thermoelectric measure- 
ments. 

3. Although the theory for transverse magnetoresistance is more com- 
plicated than for longitudinal magnetoresistance, a combination of rea- 
sonable arguments with empirical evidence yields a rough upper bound 
for the orbital quantization effect. Contradictory approaches to the 
theory have been elaborated for the case of acoustic scattering by 
Klinger and Voronyuk,’* Argyres” and Wolff; the resulting formulas 
can be evaluated for high fields (hw, >> kT’), but become very cumber- 
some at lower fields. A plausible presumption is that for constant carrier 
concentration and in the limit of small scattering (Hall angle practically 
x/2) the departure from the resistivity given by the older transport 
theory varies at least as fast as hw,/kT, as long as this quantity is rather 
less than unity. The size of the factor of proportionality cannot be esti- 


mated reliably from the nonsaturation of transverse magnetoresistance, 


* These authors evaluate their expressions only for the limiting case of fields 
so large that the energy of an average phonon causing transitions of the electrons 
is > kT’. However, it is easy to show that, for the case hw./kT > 1 and phonon 
energies negligible, this approach yields a magnetoresistance that goes very nearly 
as H*, (Note added in proof: We are indebted to E. N. Adams and T. Holstein 
for communicating to us an as-yet-unpublished analysis that shows the approach 
of Ref. 9 to be the correct one.) 
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since this can be shown to arise in large part from other effects. However, 
the factor can be estimated from thermoelectric data at liquid-hydrogen 
and intermediate temperatures. 

4. The fair agreement of theory and experiment on high-field trans- 
verse magnetoresistance would be spoiled if orbital quantization in- 
creased the resistivity by any sizable amount. The resulting discrepancy 
would be hard to explain away, as most perturbing effects (inhomoge- 
neities, surface conduction, etc.) also increase the magnetoresistance. 

Turning to question ii at the beginning of this section, we must con- 
sider scattering processes which change the energy. In highly doped ma- 
terial the most important processes of this sort are electron-electron col- 
lisions. There is also another effect: at carrier concentrations high enough 
to make the low-frequency phonons have an effective drift velocity com- 
parable with that of the electrons — the so-called saturation effect* on 
II, — the contribution of any electron group to II is reduced by the un- 
balance of the distribution function of the phonons with which it inter- 
acts. Since this unbalance is influenced by the interactions of these 
phonons with all other electron groups, it is no longer possible at such 
concentrations to separate II - j into a sum of Il, - j,. At low carrier 
concentrations, however, both these effects become negligible, and the 
only type of energy change on scattering which must be considered is 
that due to the finiteness of the energies of the phonons. Only acoustical 
phonons need to be investigated, since scattering by intervalley or non- 
polar optical modes occurs with almost equal probability to all final 
states in an energy shell,” and therefore is describable by a relaxation 
time dependent only on the initial state. 

The most detailed study which has appeared on the effect of acoustic- 
phonon energies is that of Dorn.’* He estimated the effect on the distri- 
bution function for a simple-model semiconductor in an electric field, 
using both a high-temperature expansion method and a variational 
method. He found that, as one expects, the fractional alteration in the 
mobility due to the phonons having finite rather than zero energy is 
very small in the temperature range of interest to us. The alteration is 
of the order of m*c’/k7T, where m* is the effective mass and ¢ the velocity 
of sound. With a reasonable average m* for n germanium, m*c’ /k is only 
a small fraction of a degree. However, Dorn’s expressions show that the 
fractional alteration in the current of a shell of energy ¢ is of order m*ec° /e 
for e < kT but > m*c’. Since for pure lattice scattering the low-field 
magnetoresistance and AQ effects are dominated by the behavior of low- 
energy carriers, the effects of the phonon energies may be expected to 
be considerably more serious for these than for the mobility in the ab- 
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sence of a magnetic field. A quantitative estimate of these effects in a 
magnetic field and with anisotropic masses would be of some interest. 
However, it seems likely that they are still small. The fractional altera- 
tion in the H” term in the current of a shell should still be of order 
m*c’/e, although perhaps with a slightly larger coefficient than for Dorn’s 
case. Since, for a shell € to « + de containing dn carriers, this H® term 


goes as erdn « de for « < kT’, the phonon-energy correction should 


involve { de/e. Thus, we guess that 


fractional correction to magnetoresistance me n Be < 8) 
kT m*c? 
Since the phonon-drag Peltier coefficients of the different shells do not 
vary much with energy, (8) should apply to the thermoelectric power 
also. For n germanium in the range near liquid-air temperature, the 
value of (8) is at most a few per cent. 

We can now give a prescription for answering question iii, the last of 
the three posed at the start of this section. This question had to do with 
the legitimacy of approximating the distribution function in each energy 
shell by a linear function of crystal momentum. Since questions i and ii 
have by now received favorable answers, we need only to solve the trans- 
port equation for an energy shell under the influence of arbitrary electric 
and magnetic fields, and see how well the current and Peltier-heat con- 
tributions from the accurate solution agree with those from the electron- 
group approximation. Now the latter approximation amounts to retain- 
ing only / = 1 spherical harmonies in the distribution function in a new 
set of variables which take the ellipsoidal energy surfaces into spheres* 

variables which we have used for other purposes in Appendix A. The 
terms in the scattering operator which mix / = 1 with higher / values - 
call these S,,;— can be shown’ to affect the electric current only to the 
second order, i.e., | Sy, |’. They may, however, affect the Peltier flux to 
order S,,7',, where 7; measures the Peltier flux which would be pro- 
duced by a distribution with the / value in question. Now the longitudi- 
nal and transverse branches individually contribute quite anisotropic 
scattering, but their combined effect in germanium is almost isotropic, 
so the S,,; for 1 # 1 are small. But since the relative contributions of the 
two branches to the phonon-drag Peltier effect are not known, it is quite 
conceivable that the S,,7'; for 1 # 1 are not negligible, even though the 
| Si, |’ are. A detailed investigation of this question using deformation- 
potential theory has been made for the case / = 3 and, to some extent, 
for larger J, and will be reported elsewhere.’’ The result is that, for ger- 
manium, both the S,, and the 7; are quite small, and that the retention 
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of only the / = 1 term in the distribution function should be a very good 
approximation for all conduction and thermoelectric phenomena, with 
or without a magnetic field. 


III. PROCEDURE FOR COMPUTING B AND AQ 


Having satisfied ourselves that the electron-group model described in 
Section I is a good approximation for the cases we wish to analyze, let 
us now use this model to evaluate explicitly the expressions for B and 
AQ which were given there. In both cases the task is to evaluate (6). 
This determines the Nernst coefficient B, or the Nernst field (4), via 
(1) and (5). Similarly, it determines AQ via (1) and (3). Since each 
group g consists of the set of states in the energy range ¢« to € + de in 
some valley 7, it is appropriate to replace the conductivity 6, of this 
group by the infinitesimal dé’, and to rewrite (6) in the form 


W(H) = =.| | Wm’ (e) da) | -o(H), (9) 


where the tensor If" (¢€) has principal components Ily(e) and I,(e) in 
the principal-axis system of valley 7, and where the integration is really 
over energies «. The tensor dé''(H) can be obtained by solving the 


transport equation relating the isothermal current dj" in the energy 
shell to the electric field E producing it, and expressing the solution in 


the form 
aj” =dé" - &. 


For the case of nondegenerate statistics, which we shall assume 
throughout, the transport equation for dj” is easily written down, in 
the electron-group approximation, by equating the rates of gain and 
loss of erystal momentum in the shell. Anisotropic scattering can be 
taken into account’ by assigning to the shell a relaxation-time tensor ¢, 
with principal components 7)(€), 74(€) along and normal to the axis of 
the valley. The resulting transport equation — in which, for simplicity, 
we omit the valley suffix i — is’ 


etemt dj + (qj x H- é(S)Ean, (10) 
c (e) 


where dn is the number of carriers in the shell, m* is the effeetive-mass 
tensor and (e) (3kT)/2 is the mean energy, and where the upper sign 
is for electrons, the lower for holes. For small H this can be solved by 
iteration to give the contributions (48) and (49) of Appendix B to the 
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conductivity tensor dé. As H — the part of E normal to H becomes 
interpretable as the Hall field, and the asymptotic value of the part of 
dj normal to H can be determined by taking the cross product of (10) 
with H and throwing away the first term on the left. This part of dj is 
of order H™'; the H™ part of dj parallel to H can be determined from 
it by dotting (10) with H. Higher powers in the expansion of dj in H™’ 
can be obtained by further iteration. The resulting contributions to do 
are given in (68) through (72) of Appendix B. 

The resistivity tensor o(H) occurring in (9) is just the reciprocal of 
>; { dé (H). A few properties of » and dé“ are worth listing for refer- 
ence. With superscripts 0, 1, 2, --- to denote coefficients of 1, H, H’, --- 
in the expansion of » or dé in powers of H, and +1, «, —1, --- to denote 
coefficients of H, 1, H’, --+ in the expansion in powers of H™', we have, 
for a cubic crystal, 


pap” = —R(0)ZPapyll y, (11) 
re aa _ — Ri 0©)2bap,H , , (12) 


where R(0) and R(«) are the limiting values of the Hall constant at 
H = O0andH = ~o, respectively, and where 6as, = +1 when a@y is an 
even (odd) permutation of 123, zero otherwise. For n even (odd), o'"’ 
and dé” are symmetrical (antisymmetrical). The symmetrical tensor 
o”’ is finite and positive definite, but the only nonvanishing component 
of dé” is the HH component. 

The procedure of Appendix B is thus to get the high- and low-field 
expansions of dé’ (H) by solving (10); to use these, when necessary, 
to get the corresponding expansions of e(H); to substitute into (9) to 
get the expansions for various components of I1(H); and thence to com- 
pute the high- and low-field B and AQ for various orientations. The re- 
sults, containing integrals over II)(¢«) and II,(€), can be expressed in a 
great variety of forms. Although in Appendix B expressions are obtained 
for B and AQ valid for aribitrary functional forms of Tl, , 1, , and for 
valleys of either the [111] or [100] types in a cubic crystal, these expres- 
sions are cumbersome in their most general forms. These forms involve 


. . *, (n) / ( . 
Maxwellian averages of quantities eI), f°" (7) , 72), where f°” is a homo- 


geneous function of degree n in its arguments. These forms therefore 
simplify greatly if the II’s and 7’s are each proportional to some power 
of the energy e. Now II,y = Te. is linear in ¢, while our previous analy- 
sis’ indicates that II,) and II,. probably vary at a rate between con- 
stancy and ¢*. Moreover, for ideal acoustic scattering and negligible 
phonon energies 7) and r, are «¢*, Thus, specialization of the formulas 
to power-law dependences of the II’s and 7’s can give a fair picture of 
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the behavior of 11, and IM, separately, although not as accurate a picture 
as we shall need for our final analysis. We shall give such a specialization 
in the next section, before discussing more complete formulas. 


IV. COMPILATION OF FORMULAS 


Tables I through IV give formulas for high- and low-field B., B, , 
AQ, and AQ, , as obtained by the procedure just described for various 
vases corresponding to energy-independent anisotropies. The formulas 
are for [111] valleys and are specialized to the cases II, ,. « « ’ or inde- 
pendent of ¢, and 7),, <€ * or independent of ¢. Comparison of the latter 
two alternatives makes possible a crude estimate of the effect of impur- 
ity scattering. Although these special cases do not give as accurate a 
representation of the data as we shall ultimately wish to use, they do 
give a fair representation. They are worth listing because their compara- 
tive simplicity makes it easy to see how sensitive the various measurable 
quantities are to the assumed anisotropies and energy dependences. All 
the tables contain references to equations of Appendix B. Most of these 
equations are more general expressions for the quantities tabulated, valid 
even when the anisotropies are not energy-independent. Also, they are 
applicable to multivalley band structures different from that of ger- 
manium. 

The formulas for the low-field Nernst coefficient (see also Equation 
B.4 of Ref. 1) are most conveniently expressed in terms of the dimen- 
sionless coefficients ¢. and ¢, , defined by 


B. re (‘) (“) ; 
e€ c 
B, = fy Q> (“*), 


where, as always, B, and B, represent, respectively, the electronic and 
phonon-drag contributions to B. Table I gives the expressions obtained 
in Appendix B for ¢, and ¢, for the case of energy-independent aniso- 
tropies. Note that (1 + ¢,) is the product of a factor dependent only 
on the energy variations of the II’s and 7’s by a factor dependent only 
on their anisotropies. The anisotropy factor is unity when the tensor 


m* - ¢° is isotropic, and likewise when the tensor II,, is isotropic 
(II,, = Il,.); the energy factor is unity if either + or II,, is independent 
of energy. Fig. 1 shows the dependence of the anisotropy factor in 
(1 + fp) on p = II,)/I,4 . Curves are drawn for several values of the 
anisotropy w of m* - ¢ ', chosen to encompass the ranges likely to occur 
for germanium and silicon. 
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TABLE | 

Values of the dimensionless coefficients ¢,, ¢,, related by (13) to the 
electronic and phonon-drag parts of the low-field Nernst coefficient, 
respectively. The expressions given apply when the ratios w = 7,m,*/ 
r4m* and p = II,)/II,a are independent of energy. They are valid for 
any type of valleys in a cubic erystal (e.g., n silicon or n germanium). 
Angular brackets represent Maxwellian averages, i.e., averages with 
weight e'/? exp (—e/k7T). For 7, one may take either 7 or 7,4, and, for 
ie either II, ori, « 


1+ fp 


General formula 


(ell,7?Xer) (2 + w)(1 + w + | 


(ell,r){er?) L (1 + 2w)(2 + pw) 


Reference, Ap (57), (50) (57), (51) 
pendix B 

Value for: 

ga tlt, : 1/x [as above] 
f,~« «ts 

7 independent of 1 [as above] 
e, any II,(e) 

Any r (e), I, in 1 {as above] 
dependent of « 


Table II gives the part of BH going as H™' when H — ~, for certain 
special directions of H and j, again as obtained by specializing the for- 
mulas of Appendix B to the case of [111] valleys and energy-independent 
anisotropies. Note that at high fields the electronic contribution to the 


Nernst field is anisotropic and depends on the anisotropy w of m*- +, 


whereas at low fields it does not. Fig. 2 shows the dependence of the 
phonon-drag contribution on p = II,)/I,. , for two assumptions about 
the energy dependence of II,),. . Several features of the graphs and for- 
mulas are worth noting: 

i. The high-field Nernst coefficient is very much larger when H is in 
an [011]-type direction than in an [001]-type. This could have been antici- 
pated from the familiar difference in rapidity of saturation of the Hall 
coefficient in these two directions, and the close relationship of the 
Nernst and Hall effects.’ The cause in both cases is the fact that, for 
H along [011], two of the four valleys have a high-mass direction exactly 
normal to H, and so require large fields to obtain Hall angles near 2/2. 

ii. The theoretical behavior of the Nernst coefficient differs from that 
of the Hall coefficient in that, when H is along [011], the value of B is 
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Fig. 1 — Dependence of the anisotropy factor in the expression of Table I for 
1 + ¢) on p = II,/Ilp4 , for various values of w = m*.a1,/m* ro . 
slightly different for V7 along [100] and [011]. Equality of Hall coeffi- 
cients for these two directions of j is required by the Onsager relations 
(principle of microscopic reversibility), but equality of the Nernst co- 
efficients is not.’ For the phonon-drag B, the theoretical difference is 
very small when II,) > I, , but for the electronic B, it is sizable, e.g., 
a factor 1.62 for acoustic scattering and w = 0.06. 

iii. From Tables I and II it follows that the ratio of B, to B, is not 
usually the same at high fields as at low. For most specimens, therefore, 
there should be a range of temperatures — near that for which B, + 
B, = 0 — where the sign of the Nernst voltage will change with increase 
of H. Substitutions in the formulas give the result that, for any values of 
the anisotropies in the neighborhood of those obtaining for pure germa- 
nium (p ~ 10, w + 0.06), theratio | B,/B, | isgreater at high fields than at 
low fields whenever H is along [011]. When H is along [001], this ratio 
is less at high fields than at low if II,),. increase with decreasing energy, 
but becomes the same at high and low fields if II,),. are independent of 
energy. 

The electronic part of the thermoelectric power, Q,(H_), behaves very 
simply as H — «. When H and j are parallel, the energy distribution 
of the current is the same at H = « and H = 0. So, for H || VT, 


Qe( oo} = Q.(0) : : - - (14) 
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Fig. 2 — Dependence on p = II,,/II,4 of the phonon-drag part of the Nernst 
coefficient at high magnetic fields, for two orientations of a cubie crystal with 
{111] valleys. All curves have been computed assuming w = m* .7)/m*)7r1 = 0.06, and 
pure acoustic scattering (7 <e~'/?). The full curves apply for Il, «e~'?, the dashed 
for II, independent of energy. The curves are fairly insensitive to the energy de 
pendence of 7, but depend strongly on w, the lower curves being for small w almost 
proportional to w, the upper to w!. For H along [011] and V7 along [100] (not 
shown) the dashed curve would coincide with that for [011][011], while the full 
curve would differ by only a few per cent over most of the range shown. 
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TaBLe II] 
Values of the saturation ratio Q,(H — «)/Q,(0), for special directions 
of H and V7’. Assumptions and notations are the same as for Table I 
except that the expressions apply only for [111] valleys (n germanium). 
The energy factor (eIl,)(er)/(e)(ell,vr) has the value 8/(3r) = 0.849 
when II, « «~', and the value 1 when either II, or 7 is independent of 


energy. 


Foomace oy Value, General 
(2 + w)(p + 2) 
3(2 + pw) 
(2+ w)(1 + w + pw) 
(1 + 2w)(2 + pw) 
(2 + w)(6 + p + 2pw) 

(7 + 2w)(2 + pw) 
(eller) [ (2 + w)(1 + w + pw) 
(e)(ell 7) (1 + 2w)(2 + pw) 

ell,Xer) [5 + w+ pt 2p | 
(e)(ell pr 3(2 + pw) 
(ell,Xer) [ote + 2) 
‘e)(ell pr) 3(2 + pw) 
(ell,Xer) [13 + 5w + Sp + 4pw 
9(2 + pw) 


[001], [001] (78) 


{O11}, [O11] (79) 


{111], [111] (80) 


(011), any 1H (81) 


(011}, [100] (81) 


{O01}, [011] (81) 


(O11), [111] (81) 


(e)(ell pr) 


where | « — e;| isthe distance of the Fermi level from the band edge, 


the upper sign is for electrons, the lower for holes, the angular brackets 
denote Maxwellian averages, and, for the present case of energy-inde- 
pendent anisotropies, t may be taken as either 7) or 7, (only the energy 
dependence matters). When H and j are perpendicular, the energy dis- 
tribution of the current is the same as it would be for H = 0, + con- 
stant. So, forH |. VT, 


Q.(2) = *| : (‘) +1% pel. (15) 
2\e el 


Table III gives the formulas for Q,( ), the saturation value of the 
phonon-drag part of the thermoelectric power, again for [111] valleys 
and energy-independent anisotropies. The ratio Q,(~* )/Q,(0) is the 
product of a factor dependent on the energy variations of II,, and m*-¢ | 
by a factor dependent on their anisotropies. Figs. 3 and 4 show how these 
anisotropy factors depend on II,)/Il,. , again for different assumptions 
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H, VT II [oor] 
(2+w)(p +2) 
3(2+ pw) 


VTi fi] 


(2+w)(6+p+2pw) 





(7+2w)(2+pw 


ro) 


H, ¥TII [011] 
(2+ w) (i+w + pw) 
(1+2w)(2+ pw) 





Fig. 3 — Dependence on p = II,/Il,4 of the expressions of Table ILI for Q, (+ 
(),(0), for H parallel to V7' in various directions of a cubic erystal with {111} val- 
leys. The full curves apply for w = m*.17,/m*r1 = 0.06, the dashed curves for 
w = 0.08. 


on the anisotropy of +. For longitudinal effects (V7 || H) the energy 
factor is unity; thus the ratio for these cases gives information on the 


anisotropy of II,, independently of its energy variation. For the trans- 
verse effects the energy factor goes to unity if either II,, or + is inde- 
pendent of energy. 
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Fig. 4 — Dependence on p = I1,,/II,4 of the anisotropy factors in the expres- 
sions of Table III for Q,(~)/Q,(0), for various orientations of a cubic crystal with 
{111} valleys, when H is normal to V7’. The full curves apply for w = m*.17;/ 
m*,rs = 0.06, the dashed curves for w = 0.08. Note two of the factors are the 
same as for Fig. 3. No curves are given for H || [011], V7 || [111], as this case is 
just one-third the way toward [011][100] from (011]{011]. 
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Table IV, finally, gives the expressions for the low-field limits of 
AQ./H’*, SQ,/H’. These have a less simple behavior than the high-field 
limits. However, it is noteworthy that, when H is in the [001] direction 
and VT in the [100], the expression for AQ,/Q, involves the anisotropy 
of II, only through the same coefficient £, as occurs in the expression 
(13) for B,. The reason for this has been elucidated in our previous 
paper.’ 


V. DATA AND CORRECTIONS 


5.1 Measurements and Extrapolations 


Measurements of electrical and thermomagnetic properties were made 
on a number of samples of n-type germanium having various orientations 
and dopings, at a number of temperatures from the liquid-hydrogen 
range to room temperature, and for magnetic fields up to 18,000 gauss. 
The samples were of the usual shape with three pairs of side-arms. Heat 
or electric current was fed in at the ends, the potential drop was meas- 
ured between a side-arm of the first pair and one of the last pair, and the 
Hall or Nernst voltage was measured between the two middle side-arms. 
A full description of the experimental procedures has been given in our 
arlier paper.’ 

Table V gives all the thermomagnetic quantities measured, as func- 
tions of magnetic field, for temperatures above 60°K; measurements were 
also made at a number of fields intermediate between those listed. Such 
data may be useful for comparison with future theoretical expressions 
for the various quantities in the intermediate-field range. In this paper, 
however, we have developed the theory only for the limiting cases of 
high and low fields, and so the most appropriate quantities for compari- 
son with theory are the high- and low-field limits; these are given in 
Table VI. The limits of the various quantities as H — 0 were obtained, 
since the quantities are even functions of H, by plotting the quantity in 
question against H’, fitting the low-field part of the plot to a parabola, 
and noting its intercept. The average size of the random errors involved 
in this procedure can be gauged from the sample plots given in Fig. 5. 
It will be noted that, at low fields, AQ is a little less accurately deter- 
minable than Ap, because of the smaller signals involved; similarly, the 
errors in B are larger than those in R. For the same reason, the random 
errors in the thermomagnetic quantities become larger, percentagewise, 
at higher temperatures. The extrapolations to H = « were made in the 
same way, by plotting the quantities against 1/H’ and fitting with parab- 
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olas, as shown in Fig. 6. The legitimacy of assuming that AQ and Ap are 
representable by power series in 1/H” is borne out by the calculations of 
Appendix B, within the limitations imposed by the neglect of orbital 
quantization, inhomogeneities, etc. However, these latter factors are 
known to prevent Ap from saturating completely as H — ~, so the ex- 
trapolations may be slightly different from the value which it is appro- 
priate to compare with the present theory. We shall discuss this further 
below. Random errors in the high-field extrapolations are, however, in- 
consequential, as Fig. 6 shows. 

A general discussion of the sources of experimental errors and the 
probable accuracy of the data has been given in our previous paper,’ and 
will not be repeated here. However, there are some new observations 
which have an important bearing on the trustworthiness of the results, 
and in the following paragraphs we shall discuss these and some theoreti- 
cal adjustments which should be made to the data in order to make them 
comparable with the theory. We have tried to evaluate the more predict- 
able of these corrections for all the table entries that we shall use in the 
analysis in Sections VI and VII, and we have given the sum of these 
corrections in parentheses after the quantity to which they apply. 


5.2 Surface Damage 


In the attempt to track down empirically the magnitude of the effect 
of orbital quantization (an investigation which we have summarized in 
Appendix A) we encountered a stumbling-block in the high value of the 
infinite-field extrapolation of the resistivity normal to H when H is in a 
[100] direction. At 77°K, for example, the originally measured Ap/p» for 
Sample 606 extrapolates to a value 0.5 unit above the value predicted 
by the electron-group model. This is many times larger than the ex- 
pected effect of orbital quantization as deduced from other evidence. 
However, such an increase could occur if a partial short-circuiting of the 
large Hall field were effected by surface or dislocation conduction. A 
surface contribution to the conductance of only a few parts in a thousand 
would suffice. To check this, some of the measurements on this and other 
samples were repeated after etching off about a mil of the original sand- 
blasted surface with superoxol. For the case mentioned (Sample 606 at 


77°), two-thirds of the original excess of observation over theory disap- 


peared; the full excess reappeared on sand-blasting again. Thus, it is 
clear that reliable transverse magnetoresistance results cannot be ex- 
pected with sand-blasted surfaces. 

Theoretically, one would not expect most of the other tabulated quan- 
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T = 77,.4°K 
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Fig. 5 — Sample plots of low-field Nernst, magnetoresistance and AQ data 
against the square of the magnetic field H showing extrapolations to zero field. 
Here AVgQ is the change in the measured thermoelectric emf in volts between the 
end-arms, due to H, at constant temperature gradient. The field H is measured in 
gauss, and B is in volts/degree gauss. 


tities to be nearly as sensitive to surface effects as is the high-field trans- 
verse magnetoresistance, since only in the latter case is a large Hall field 
present. The high-field Nernst coefficient (see Section VIII) is the one 
exception, since its approach to zero makes a small Hall field due to 
longitudinal counter-currents amount to a large percentage error. The 
measurements confirm the expectation that the effects on the other quan- 
tities should be small. However, though small, the changes found in the 
apparent mobility, etc., were often appreciable. The effects seem to in- 
volve a complex interplay of reduced mobility in the damaged surface 
layer and conduction with little or no Hall effect; moreover, the latter 
conduction seems to be extremely anisotropic, since the large excess of 
magnetoresistance was found only when there were [100] surfaces normal 
to the magnetic field. 

Table VI lists, along with the original values for sand-blasted surfaces, 
the values (identified by asterisks) appropriate to etched surfaces, for 
all cases for which the latter were measured. 





PHONON-DRAG EFFECT IN N-TYPE GERMANIUM 


5.3 Side-Arm Corrections 


The raw data given in Tables V and VI were obtained from the meas- 
urements on the assumption that the flow of heat or electric current 
down the sample was purely one-dimensional and that the side-arms 
acted merely as infinitesimal probes with which potentials could be meas- 
ured without disturbing this flow. This assumption is not quite correct, 
for the width of the side-arms (0.06 cm) was not negligible compared 
with the width (0.15 cm) of the current-carrying portion. Currents 
passed through such a sample must bulge out a little into the side-arms, 
in a way which will be altered by a magnetic field. Moreover, the aniso- 
tropy which a magnetic field introduces into the thermoelectric power 
can cause circulating currents to flow near the side-arms in the presence 
of an electric field, even though no net current flows along the specimen. 
These effects can become very serious when a large magnetic field is ap- 
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Fig. 6 — Sample plots of high-field magnetoresistance and AQ data against the 
inverse square of the magnetic field H, showing extrapolations to infinite field. 
Notation and units are the same as for Fig. 5. 
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plied in the direction of the side-arms. For example, in indium antimo- 
nide magnetoresistances of several times the ideal value have been 


. : 16 yO . 
observed for this geometry.” For this reason, all the transverse measure- 


ments reported here were made with the magnetic field normal to the 
plane of the arms, and we shall show that, for this case and for all longi- 
tudinal and zero-field cases, the side-arm errors are small and can be 
approximately corrected for. 

A number of statements about the effects of the side-arms can be made 
from simple theoretical considerations. For example, it is obvious that 
the current distortions have no effect on the measured values of the 
zero-field thermoelectric power or of the Hall coefficient. The conduct- 
ance of the sample is, of course, higher with the side-arms than without; 
measurements on scaled cutouts of stainless-steel sheet showed the dif- 
ference of true and apparent resistivities in the absence of a magnetic 
field to be about 2.6 per cent for our geometry. However, it can be shown 
fairly easily that the measured transverse magnetoresistance Ap/p, and 
likewise the transverse AQ, are unaffected by the presence of the side- 
arms when H is in a [100] direction; this invariance arises from the two- 
dimensional nature of the potential distribution and the isotropy of p 
and @ in the plane normal to H. When H is in a [110] direction the aniso- 
tropies of Ap and AQ cause a difference between the apparent and true 
Ap/p and AQ, which for transverse cases can be expressed in terms of the 
anisotropies and the correction to the zero-field resistivity. When H is 
longitudinal, more complicated effects arise, due to the Hall fields where 
the current lines depart from longitudinality at the bases of the side- 
arms. However, for almost all the cases listed in Table VI there exist 
scaling relations which allow the side-arm corrections to be expressed, 
at least approximately, in terms of the correction to the apparent con- 
ductivity at H = 0. 

Several experiments were performed to verify that these effects were 
small and to evaluate the corrections for the two cases (high-field trans- 
verse AQ with H in a [110] direction) for which the scaling relations do 
not suffice for a theoretical prediction. These experiments consisted in 
removing the middle pair of side-arms of some of the specimens and no- 
ting the changes in the apparent values of Ap/p and AQ. The changes due 
to removal of all side-arms would, of course, be twice as great. Table VII 
summarizes the theoretical and experimental results for the corrections 
to be added to the entires of Table VI. In no case are these greater than 
about 4 per cent. 


5.4 Gross Inhomogeneities 
Gradients of impurity concentrations can greatly falsify the results. 
‘or example, the apparent Hall constant depends on the carrier concen- 





PHONON-DRAG EFFECT IN N-TYPE GERMANIUM 


TABLE VII 


Approximate correction factors by which the entries of Table VI should 
be multiplied to give the ‘“‘true’”’ quantities as they would be measured 
on a specimen without side-arms. Except for the AQ’s, the factors are 
independent of temperature, or nearly so. In the last two rows, the 
factors given apply in the neighborhood of liquid-air temperature. 


Direction of j or VT {100} {100} 0 {110} {100} 
Direction of H {001 | {O11} ) {110} | [100} 


Quantity: 
ras 1.026 026 | 1.026 


lim (Se) 1.000 | 0.987 | 1.006 | 0. 996 
H~0 p 


lim o(H) 


(0) 1.000 994 1.008 : .974 | 0.974 
Ho Pp 


B(O) 0.974 .974 0.974 
.. oe 

| 1.000 .975 1.009 
nd HP ie 


lim AQ 1.000 000 1.000 


H~« 


tration between the middle side-arms, while the apparent conductivity 
depends on the carrier concentration over the whole length; if the con- 
centration varies, the apparent Hall mobility will be incorrect. Most of 
the samples were checked for homogeneity by measuring Hall constant 
on the end arms as well as on the middle pair. All three measurements 
usually agreed to within 1 per cent. Inhomogeneities can also cause meas- 
urements designed to give quantities even in the magnetic field (Ap, AQ) 
to contain contributions from effects odd in the field (R, B), and vice 
versa. This effect occasionally showed up as an asymmetry in the meas- 
ured values when the direction of H was reversed. This asymmetry was 


small for all cases listed in the tables and was consistent with the present 


interpretation, e.g., the magnetoresistance asymmetry could be caleu- 
lated from the Hall inhomogeneity. The entries represent averages of 
measurements for H and —H. 


5.5 Value of the Temperature Gradient 


The measurements tabulated in Tables V and VI were all taken with 
the special-purpose apparatus described in our earlier paper. With this 
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apparatus the temperature gradient used for the thermomagnetic effects 
was not measured directly, but was computed from the thermoelectric 
voltage developed between the potential leads at H = 0, together with 
the thermoelectric power Q of the specimen. The latter was computed on 
the assumption that, for a given temperature, the only source of varia- 
tion of Q from specimen to specimen is the variation in the height of the 
Fermi level associated with the different carrier concentrations. In other 
words, the Q for any sample was computed from values previously meas- 
ured on other samples by adding a term equal to 86 uv/degree times the 
logarithm of the ratio of carrier concentrations. This will not be correct 
if impurity scattering affects Q appreciably. 

Previous empirical evidence suggested that, for fairly pure material, 
Q is almost unaffected by impurity scattering. The same conclusion fol- 
lows from a theoretical analysis which we shall carry out in Section VII. 
When 7 = 77°K, it turns out that no correction is needed to the data 
for Samples 603, 604, 606 and 610, on which most of the analysis of the 
next two sections will be based, nor for Sample 595. The corrections 
should be small for these samples at 7’ ~ 60°K, and for Samples 576A 
and 580 at 7’ = 77°. For Sample 601 the correction begins to be percep- 
tible at 94°, and for Sample 596 at 87° it amounts to about 8 per cent of 
the quantities tabulated. 


5.6 Boundary Scattering 


The final—and largest—correction we shall consider has to do with 
the scattering of phonons from the boundary of the specimen. We wish 
to interpret the data in terms of a theory of conduction in an infinite 
homogeneous medium. The side-arm correction has eliminated effects 
of the shape of the sample, but we have yet to correct for the finiteness 
of its dimensions. 

We may start from the observation that phonon-drag effects are due 
to the anisotropy of the low-frequency part of the phonon distribution. 
This anisotropy, due to the temperature gradient, has a certain value at 
depths below the surface that are large compared with the mean free path 
of a phonon. However, immediately below the surface the anisotropy is 
only about half as great, since that half of the phonons that comes from 


the surface has no tangential anisotropy if the surface scatters diffusely. 
Thus, one may say, approximately, that the tensor Q, has one value at 
depths greater than some value A, of the order of a phonon mean free 
path, and has half this value at depths less than A. The effect of this al- 
teration is easily seen to be to make the effective Q,(H) less than the in- 
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terior value by the ratio 


Q, 2A 


dics nb & sa te 

Q, (ideal) At’ 
independently of H, where A is the cross-sectional area of the sample. 
The effect on the low-field Nernst coefficient can be calculated similarly; 
it is 


(16) 


a (17) 
B, (ideal) ‘ 
where ¢ is the thickness of the sample normal to its plane. We shall take 
A as roughly independent of the orientation of the surface and equal, in 
millimeters, to 0.037 (77°/T)**, the best value currently available from 
experiments on the variation of Q, with diameter near liquid-air tem- 
perature.” 

In parentheses after each entry in Table VI we have given the sum of 
the side-arm correction calculated from Table VII and (for the thermo- 
electric quantities) the temperature-gradient correction and the bound- 
ary-scattering correction computed from (16) or (17). This total corree- 
tion is to be added to the table entry to get the ideal value appropriate 
for the comparison with theory. 


VI. ANALYSIS 


At each temperature and for each impurity content we have 10 or 11 
measured quantities which depend on the partial Peltier coefficients 
II\(¢€), Mu(e) of the energy shells, and among which no relations deriva- 
ble from phenomenological theory exist. These are: Q and B at H = 0, 
AQ(H = o) for [100], [110] and [111] longitudinal orientations, one 
transverse AQ(H = «) for H || [100] and two such for H || [110], and 
two (or three) constants describing the low-field AQ behavior. (There 
are three phenomenologically independent constants of the latter type, 
but there is one relation between them which follows merely from the 
band structure and the electron-group approximation.’) We shall now 
derive what information we can about II,)(¢) and II,.(e) by fitting the 
theory of Section IV and Appendix B to the data. 

We shall first note briefly that all these data can be approximately 
fitted by the specialized formulas of Tables I, III and IV, with three 
adjustable constants A, , A, and n, where 


P n 
IT py,4 — A (5) 
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TaBLeE VIII 


Comparison of thermomagnetic data observed at 94°K with values com- 
puted from the formulas of Tables I and IIT and from the mean moment 
curves of Fig. 7. For the former calculation w = 0.061 and the ap- 
proximately optimum values p = II,,/Il,1 = 9.5, @, = 697,11, « «—° 
were used. The corrections in parentheses in Table VI have been included 
in the observed values, and the data for Samples 603 and 610 have been 
corrected to 94°. The data for Sample 606 are starred to indicate that 
this sample had been etched. The units of the AQ’s are wv /degree, those 
of B(O) are 10~°v/gauss degree. Acoustic scattering has been assumed; 
correction for impurity scattering would lower the computed B(0) 
slightly. In the first row, Q(0) — Q* is T— times the Peltier heat relative 
to the band edge, i.e., the sum Q, + 172uv/degree. 
Directions Value Com Natu Cian 
Sample puted from | Observed 


o Tables I Value puted trom 
wT and III Fig. 7 


Quantity 


()* Q(0) 869 869 863 
AQ(@ ) {O01 | {001 | 1440 1459 
1484* 


(111) 605 5: 615 
(O11| 118 124 
(001 | | 91 , 93 
(011 1325 1187 
(011 (100] 655 31: 610 
B(O) 22 34 


and with the value of w = m*, r)/m*)r, given by, say, the high-field 


magnetoresistance measurements. Table VIII gives, in its fifth and 
sixth columns, a sample comparison of observed values with the pre- 
dictions of this simplified theory; low-field AQ values have been omitted 
from the table because, as we shall see in detail in Section VII, they need 
to be corrected for impurity scattering. The extent of the agreement is a 
fairly good measure of the adequacy of the assumed behavior of II, , 
since for most of the quantities listed the electron-diffusion contribution 
is small enough so that uncertainties in its value are unimportant. The 
observations are fitted well enough to engender some confidence in the 
theory, but there are systematic discrepancies which suggest that II,, 
and II,., may have different energy dependences that a less restrictive 
analysis could reveal. 
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6.1 Analysis in Terms of Moments 


As we have mentioned in Section III, the most general formulas of the 
electron-group theory, as derived in Appendix B, give Q or AQ and B as 
linear combinations of Maxwellian averages of quantities of the form 
ely, .f'"’ (7) , 72), where f”’ is homogeneous of degree n. The coefficients 
contain in their denominators linear combinations of similar Maxwel- 
lian averages, without the II’s. Now when n is 0 or 1, corresponding re- 
spectively to the high-field transverse Q and to Q(0), or the high-field 
longitudinal Q, it should be quite a good approximation to take 7) and 
r, both proportional to «? in the purest of our samples (ideal acoustic 
scattering). The value of w = m*,7)/m*)7r, appropriate to these cases 
can be determined from the high-field longitudinal magnetoresistance. 
However, for n = 2 and especially for n = 3 (low-field effects), the 
Maxwellian averages in question become very sensitive to small amounts 
of impurity scattering, and one might question the legitimacy of taking 
the ratio 7;/r. independent of energy. But since the term containing 


” 


ty"7s” " also contains m*, "m*,"~", and since Ily,, vary only slowly 


with energy, it will always be a good approximation to assume that, 
with angular brackets as usual denoting Maxwellian averages, 


m n my 
/ T T , \ , 
(ell at ) (++) yw at: (18) 
\ m* m* , * 


‘e( mY alee (19) 
m P 


where, for n = 1, 2, 3, w, is determined from purely electrical data. 
Specifically, we can define w, so that (19) is exact for m = 1; (18) will 
also then be nearly exact for m = 1. For m > 1, (19) and (18) may be 
less accurate, but this inaccuracy will have very little effect on any elee- 
trical or thermomagnetic quantity, because the terms with m > 1, be- 
ing of order w,,”, will be very small. 

We shall use the approximations (18) and (19) to express the various 
thermomagnetic quantities in terms of ““moments” Ty" and 11,°"’, where, 
forn = 0, 1, 2, 3, 

my. = (era Ma) (20) 

(eT 2”) 
From the formulas of Appendix B we obtain the expressions listed in 
Table IX. Since the low-field Nernst coefficient involves Tly.,°’ and 
Il), ., we have listed as the second entry in the first column that com- 
bination of B and Q which involves only the Il), ,. Similarly, we have 





THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1959 


TaBLe IX 
Thermomagnetic quantities in terms of the moments a defined by 
(20), as obtained from the formulas of Appendix B using the approxima- 
tions (18) and (19) and the assumption of n-type material and [111] 
valleys. 


Reference, 
Appendix B 


Directions of | 


Quantity and vT | Expression 


11, + 1 (5+w) I 


T O11] [100] | 
T'Q( ) [ ca J 3 (24+ w) 


1142 

32+ w 
ea 1 ‘ 

fo11} {O1i] | 51, + 30,0 


1+w 
001] any |H ~ TI, (0) 
[001] any 1 1 +3 “Fw th + 1 + 2w es 


{001} [001] ; Ty +3 1,” 


(111) [111] | 


1 + 2w 
T+ oe WO +7 m1, 


+ Qw 


w 1+w 
[O11] [011] ‘tis 1, + E> , 


TQ (0) . - ; ~ ~ Ty += 


¥r 
B(O) 4 121 ; We iy? + = .®) 


1 + 2w2 1+2 


(H/C) 


¥ [- lim 
H-0 
AQ + (BuyH? Cc) 
(Ap p) + “4 c)? 
3 1 FR 
: i lim - ¢ ] ine : _T,4 (58), mod- 
[- H_0 ve mn bes 001} (001) . lhe 1 — w; ' ified (65) 


’ > 
+Q] foot} (100) | 5 +5 —Ta® | (58), (65) 
~ | 3 — a 


listed in the last two rows those combinations of the low-field AQ, B, and 
Q which involve only the Il), ,. These equations can in most cases be 
inverted to give the II), , in terms of the observed quantities. Thus, the 
first two rows just suffice to determine Il), ,“, the next four rows over- 
determine II), ,“, the next row gives one relation between II, and IT, 
and the last two rows just determine II), ,“. 

Our procedure will now be to determine the moments I1),,‘”’ from these 
equations and the observational data, and then to discuss the separation 
of each such empirically determined II,,,°” into electron-diffusion and 
phonon-drag contributions and to interpret the variation of the latter 
part with n in terms of the energy dependence of II,),4 





PHONON-DRAG EFFECT IN N-TYPE GERMANIUM 


TABLE X 

Values of w; , w; and w; used in the moment analysis. The values of w; 
were obtained independently for each case by fitting the electron-group 
theory to the observed high-field limit of the longitudinal magnetoresist- 
ance. The side-arm corrections of Table VI have been included and, 
whenever possible, data obtained after etching have been used. The 
latter cases are identified by asterisks. The values of w; were estimated 
from these w,’s (assumed for the first four samples equal to 0.061) and 
the approximate dependence of w; on impurity concentration found by 
Goldberg.” The values of w: were interpolated between w; and w; . 


Empirical wi: Assumed we Assumed ws 


0.0625 0.064 
0.061* 
0.0615* 0.0615 0.062 
0.061* 0.061 0.061 
0.062 | 
0.061* 0.0625 0.064 
| 0.0625 0.0615 0.062 
610 4° | ~~ (0.061* 0.0625 0.064 
| 92 0.061 0.0615 0.062 
603 | 0.0625 0.064 
0.0615 0.062 


Value adopted for all 0.0610 
above cases 


601 94° 0.069 0.076 0.084 
87° 0.081 
131° 0.067* 0.0595 0.0595 
(0.0595 adopted, 
see text) 


6.2 Choice of the w,, 


To begin with, we need values of w, w2 and w; . Magnetoresistance evi- 
dence on w (=w,) for the present samples is summarized in the column 
labeled ““Empirical w,” of Table X. The good agreement of the w, val- 
ues from Samples 606 and 604 with that from Sample 610 speaks well 
for the validity of the electron-group model. There seems to be no sig- 
nificant change of w; with temperature in the range 60° to 94°K. No 
reliable value of w, can be obtained from Sample 603, since in this orien- 
tation a change of w,; from 0.04 to 0.08 only changes the theoretical 
high-field resistance by 0.05 of the zero-field value. The data for 603 do, 
however, agree fairly well with the theory, in that the Ap/po predicted 
from the w, of the other specimens is within 0.02 or 0.03 of the value in 


Table VI. 
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The empirical w, value for Sample 606 at 131°, though listed, is not to 
be trusted, as a field of 18,000 gauss does not give a very good approach 
to saturation. We have preferred to rely on the value 0.061 found at the 
lower temperatures, decreasing it to 0.0595 to allow for the gradual de- 
crease of w with temperature found by Goldberg,’”’” a decrease which 
also shows up, though less clearly, in our own low-field data at higher 
temperatures. 

The estimation of w; is somewhat less satisfactory. In principle ws can 
be estimated from the longitudinal and transverse low-field magneto- 
resistances, since, as we noted at the start of this section, two independent 
phenomenological constants should suffice for the complete description 
of the low-field magnetoresistance in reasonably pure material. However, 
the w; values computed in this way for Samples 603, 604 and 610 turn 
out to be quite different ranging, for example, from 0.056 to 0.073 at 
77°K, all for etched samples — whereas the similarity in impurity con- 
tent between these three samples would lead one to expect similar w’s. 
This sort of discrepancy might have been anticipated from the fact 
(Ref. 1, Fig. 13) that the four data for 603 and 604 depart perceptibly 
from the predictions of the phenomenological theory with two adjusta- 
ble constants. The trouble can arise from any of several causes. To com- 
pute w; from the data, one needs the longitudinal and transverse mag- 
netoconductances, and the second of these equals the corresponding 
magnetoresistance plus (u»/c)”. Now the empirical value of uy , unlike 
magnetoresistance and AQ values, depends on the measured thickness of 
the sample and is affected by errors in this measurement. Also, it can be 
falsified if the carrier concentration between the Hall arms differs slightly 
from the average over the length of the sample. Since a change of uy by 
1 per cent affects w; by 0.003, this source of scatter from specimen to 
specimen could well be serious. Moreover, our random errors are large 
enough to affect w; perceptibly, and comparison of etched and unetched 
samples has shown that the surface damage effect can alter the empirical 
w; by 0.007 or so. 

Fortunately, a study of the fomulas of Table [X, which we shall de- 


scribe below, shows that, once ws is specified, the T),,° computed from 
the measured quantities is not nearly so sensitive to small errors in the 
latter as we have just found the empirical w; to be. This suggests that 
one can get a fairly reliable analysis by disregarding the apparent w,’s of 
the specimens and estimating the true w; for each temperature and im- 
purity concentration in some other way. Now for pure acoustic scat- 
tering w; should be practically the same as w, , for which we have ob- 
tained the presumably trustworthy value 0.061 (Table X). Several 
studies of the effect of impurity scattering on magnetoresistance have 








Ata 
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been published,’”"*’ and, while the absolute values of w; obtained in 
these studies may possibly have been affected by various kinds of sys- 
tematic errors, their picture of the variation with impurity content and 
temperature is undoubtedly correct. From the work of Goldberg'’ and 
of Goldberg and Howard,” it appears that, at 77°K, a sample with im- 
purity concentration 7 X 10° em * has a w; differing from that of an 
ideally pure sample by only about 10 per cent, or at the very most 20 
per cent.f Thus it isa reasonable guess to say that the corresponding de- 


. . ° — 13 =—Z o - ° 
viation at a concentration of 2 X 10° em is perhaps 5 per cent, i.e., 


0.003 in w; . While the error in this estimate may be as large as the esti- 
mate itself, it is surely much smaller than the scatter mentioned in the 
preceding paragraph, and we shall see that this accuracy suffices for our 
present purpose. The last column of Table X gives the values we shall 
adopt for our analysis, obtained by adding an estimate of this sort to our 
w,. The next to the last column gives the corresponding w»’s, obtained 
by interpolating between w, and w; . 


6.3 Results for 1)" and 11,‘” 


It is now a straightforward matter to insert the w, of Table X into 
the formulas of Table IX, equate to the empirical quantities of Table 
VI (with the corrections included), and solve for the moments Il). ,°" 
defined by (20). To make a comparison of the values for different speci- 
"which de- 
pend on the location of the Fermi level through I, , but the quantities 
11* — Ily,.°", where 


mens more meaningful, it is best to compare not the Ily,.° 


(21) 


represents the part of Il, due to the difference between the Fermi level 
er and the band edge e,. The quantities II* — I),,°"’ thus are positive 
for n-type material and measure moments of the Peltier heat relative to 
the band edge. 

lig. 7 shows the results for 11* — I1,°"’ and 11* — Ily’"’, respectively, 
for a number of the purest samples at several temperatures from 60° to 
131°K. Here the moments for n = 1 were calculated from Q and the 
high-field limit of AQ for H longitudinal. Those for n = 3 were calculated 
from the longitudinal and transverse values of 


= £2 
lim 3 
H-O IP? 


t We are indebted to C. Goldberg for having informed us of some of these re 
sults prior to publication. 
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Fig. 7 — Empirical values of the moments 11) (left) and 1, (right) de- 
fined by (20), as obtained from the method of analysis described in the text. Note 
that the two scales differ by a factor of ten. The dashed curves at the bottom of 
the chart for 14“ show the contributions which II, would make to 1” or Ma” 
at the various temperatures, if the scattering were purely acoustic. The full curves 
are reference means which we shall use in Figs. 11 and 12. 
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using, if necessary, the phenomenological identities mentioned at the 
bottom of Table IV to relate the AQ values for the directions going with 
a particular sample to those for the directions listed in Table LX. For 
n = 2, only one moment can be determined; since I1,‘”’ is very repro- 
ducible and varies very little with n, we have chosen to determine 1,” 
by linear interpolation between II,” and II, and then to determine 
Il)” for each specimen by the equation specified in Table IX. For n = 0, 
a similar procedure was followed, using a linearly extrapolated value of 
11,°”; the alternative procedure of combining transverse AQ data on two 
different samples would lead to larger random errors. 

For the 131° data the method just described for determining the mo- 
ments for n = 1 starts to break down, since, as we noted above, the 
high-field limit is not reliable. However, it is probably safe to say that 
AQ( ) lies in between the extrapolated value of Table VI and a value 
increased from this in the ratio of the Ap( « ) calculated for the assumed 
w, to the Ap(~) of the table. The first moments obtained from both 
these limits are plotted in Fig. 7 and connected by vertical lines to em- 
phasize the uncertainty. A corresponding though smaller ambiguity re- 
sults for the second moment (II* — I’). The second and probably 
more nearly correct assumption gives the higher (II* — Tl") and the 
lower (I1I* — 11,"’). 

Fig. 7 also shows, for comparison, the electronic contribution 


n* — 0, 


, 


as given by an equation analogous to (20) on the assumption 7, « € 7°, 
i.e., Without correction for impurity scattering. The latter correction 
should be very small for these specimens; some estimates of it will be 
given in Section VII. It will be seen that, with our neglect of impurity 
scattering, the electronic contribution is linear in n, and that it accounts 


(n) 


for the major part of the variation of II," with n; this makes our use of 
linear interpolations and extrapolations seem reasonable. Subtraction of 
the electronic and total moments shows that | II,. | is almost independent 
of energy, decreasing slightly with increasing energy, and that | II,, | has 
a more rapid, though still modest, decrease. This is a refinement of the 
conclusion of our first paper to the effect that some average of | II,) | and 
| 11,4 | decreases with increasing energy. Since an accurate quantitative 
estimate of the energy dependences of II,) and II,, requires a correction 
for impurity scattering, we shall postpone this topic to Section VII. 

The last two columns of Table VIII give a sample comparison of the 
measured thermomagnetic quantities with values predicted from the 
mean moment curves of Fig. 7. The fit is, indeed, significantly better 





700 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1959 


than was obtained in the middle column on the assumption of energy- 
independent anisotropies. 


6.4 Accuracy of the Results 


A few words are in order regarding the sensitivity of these results to 
random or possible systematic errors in the inputs. As one might infer 
from the consistency of the values of 11,” and 11, for different samples 
and from their regular trend with temperature, these quantities are 
rather insensitive. For germanium with its small w, Table LX shows that 
the quantity 11,” equals 7Q(0) minus a small fraction of the longitu- 
dinal high-field limit of AQ; since the latter term is only about one-fifth 
of TQ,(0), errors in it have little effect. This term is almost proportional 
to w. Thus, for example, it turns out that, at 77°K, an alteration of even 
as much as 10 per cent in the assumed w, or in the high-field AQ, would 
affect the plotted quantity 1* — 1," 
The makeup of I,” is similar. It equals 7Q(0) plus a number of terms 
involving the Hall mobility and the low-field Ap’s and AQ’s; the latter 
terms are again only a small fraction of Q,(0), and they depend hardly 


by only 2 per cent of its value. 


at all on w;. For example, for Sample 606 at 77°K, a change of the as- 
sumed w; by 10 per cent would alter 11* — 11, by only one-third of 
| per cent; changes of 10 per cent in any one of the quantities B, uy, 


or the transverse Ap/pH” or AQ/H* would affect 1* — 11, by 1 per 
cent or less; a 10 per cent change in the longitudinal Ap/pH* or AQ/H* 
would have a 3 per cent effect. The sensitivity for the other cases is sim- 


ilar. Thus, the values of I1,"° should be quite reliable. 

Turning to the II,'"’, we find these quantities to be much more sensi- 
tive. The values of Ty", which depend primarily on the longitudinal 
high-field limit of AQ, should be more reliable than those for other values 
of n. Thus, for example, for Sample 606 at 77°K an error of 5 per cent 
in the high-field AQ would affect I1* — Ty" by a little over 4 per cent. 
A 10 per cent change in the assumed w would affect this quantity by 
only about one-half of 1 per cent. On the other hand, Ty is much 
more sensitive to w;, being in fact very nearly proportional to ws‘. It 
is also fairly sensitive to the [100] longitudinal AQ/H’ and Ap/pH”. For 
example, at 77°K a 10 per cent change in either of the latter quantities 
would change II* — II)” by about 9 per cent. However, the sensitivity 
to the other inputs mentioned in the preceding paragraph is small; a 10 
per cent change in any one of them at 77°K would affect 1* — Ty by 
no more than 2 per cent. The quantity II* — Tl)” behaves very similarly. 
It is very nearly proportional to wy '; however, the uncertainties in ws 
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should be rather less than those in w;. Changes of 10 per cent in B or 
un Would affect 11* — Ty at 77° by about 5 per cent, and a change of 
3 per cent in the interpolated 11, would affect it by 6 per cent. The 
values of II* — I)”, finally, are insensitive to w when derived from 
measurements with H in a [110] direction. The sensitivity to the assumed 
high-field transverse AQ is moderate; for Sample 603 at 77°, for example, 
a 10 per cent change in the latter quantity would change TI* — Ty” by 
perhaps 9 per cent. 


6.5 Other Orientations 


Several conceivable calculations have been omitted from the list used 
in the construction of Fig. 7; these correspond to cases where the ob- 
served quantity is very insensitive to the moment which one might wish 


to calculate from it. Thus, as Fig. 3 shows, the longitudinal AQ in the 
(011) direction is very small and insensitive to p (i.e., to My"” ). Similarly, 


Fig. 4 shows that the transverse AQ with H in an [001] direction is small 
and very insensitive to p (or Il\""’). Although no reliable points for Fig. 
7 can be computed from these data, the entries in the last two columns 
of Table VIII show that the AQ’s for these orientations are in fair agree- 
ment with the predictions of the theory. (It is more significant to com- 
pare values of Q( « ) Q(0) + AQ than values of AQ itself, since the 
latter quantity could conceivably be of either sign.) The small diserep- 
ancies remaining are of the order of magnitude of the expected effect of 
orbital quantization. 

VII. DEPENDENCE OF II 


pl. ON ENERGY 


The obvious next step is to convert the empirical moments of Il* — 
II), plotted in Fig. 7 into moments of —II,),. by subtracting the the- 
oretical moments of II* — II,. One may then hope to determine the 
rate of variation of II,),. with energy « by comparing the way in which 
these moments vary with n with the n-variation to be expected for, say, 
II,y,2 « ¢€. For a specimen completely free of impurity scattering, all 
this could be done very simply indeed. However, as we shall see, the sec- 
ond and third moments, i.e., those with weight factors 7,° and r,° in 
(20), are extremely sensitive to impurity scattering, since this modifies 
rT, greatly in the low-energy region, where it is large, and since II,),4 
also tend to be largest at low energies. Thus, in spite of our efforts to 
avoid the morass of impurity-scattering theory by using very pure spec- 
imens, we must still resort to it if we are to make a complete analysis of 
our data. However, the smallness of the impurity scattering in the sam- 
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ples used for Fig. 7 will turn out to simplify greatly the task of correcting 
for it. We shall therefore digress for a moment to discuss the nature of 
impurity scattering and to establish the validity of the procedure we 
wish to use to take account of it. 


7.1 Impurity Scattering 


The Conwell-Weisskopf theory of ionized-impurity scattering,” com- 
*,22,28-4 has had some success in cor- 
relating mobility data and other properties of semiconductors.”’"*** How- 
ever, a theory of this form can be rigorously justified only in the limit 
of carrier and ion densities rather lower than those normally encountered 
in semiconductors; even for this case, a rigorous treatment requires in- 
clusion of the effects of electron-electron collisions,”’ a refinement which 
has usually been ignored in the semiconductor literature. Under condi- 


bined with various modifications, 


tions such as those of our experiments, there are several aspects of the 
theory which become rather dubious, e.g., use of the Born approximation 
in scattering,” the justification for treating the screened coulomb po- 
tential of an ion as a static scattering potential, etc. Thus, about all one 
can safely say about ionized-impurity scattering in this range is that it 
is something which modifies the distribution function more and more as 
the energy becomes lower. Fortunately, this fact seems to be about all 
one needs to know in order to correlate the various effects of impurity 
scattering with one another, in the range where impurity scattering is 
slight. We shall now try to demonstrate this empirically. 

Although the fact of electron-electron scattering prevents a relaxation- 
time model from being strictly valid for impurity scattering, we shall 
follow custom and assume that, in the range of interest, the effects of 
impurity scattering can be adequately allowed for by modifying the form 
of the functions ry(€), 74(€). The usual assumption takes the form 


= ae + be}, 


where the first term represents acoustic scattering, and the second impu- 
rity scattering. This leads to difficult integrals in the various transport ex- 
pressions. Since no real justification can be given for the exact form of 
(22) — except at densities far below ours — we shall try using instead 
another expression, which has the same property of reducing 7 at low 


* For a general review with further references, see Ref. 26. 
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energies but leads to more tractable integrals, namely, 


TX afi _ a (Z)]. (23) 


Here @ is a parameter which measures the amount of impurity scatter- 
ing. 


7.2 Legitimacy of (23) 


Our argument that (23) is justified will consist of two parts. The first 
will be to show that (23) gives practically the same results as (22) for 
a variety of problems, i.e., that when impurity scattering is just begin- 
ning to be appreciable the exact form of the impurity-scattering law is 
of less consequence that the mere fact that the r’s of low-energy elec- 
trons are greatly reduced. The second argument will be empirical and 
will consist in a rough quantitative correlation of magnetoresistance, 
Hall, and mobility data, based on (23). 

The curves of Fig. 8 show the first of the comparisons just mentioned. 
The abscissas are values of the ratio ux/une, Where wy is the Hall mo- 
bility computed with (22) or (23) for a simple-model semiconductor, 
and ya is the corresponding quantity for pure acoustic scattering (b = 0 
or 6 = 0). The upper two sets of curves show the values of the ratio of 
Hall to drift mobility and of the electronic Peltier heat relative to the 
band edge. The full curves were computed from (23), while the dashed 
ones were taken from the literature” based on (22).* Note that 
the agreement is very good near uz/un. = 1, but that, as one must expect, 
it becomes poor for uyz/una < 4. For the magnetoconductivity (bottom 
curve) the agreement with the dashed curve, calculated from (22) by 
means of the tables of Beer, Armstrong and Greenberg,” is even better. 
It is worth noting, incidentally, that Mansfield” has also found uy/u to 
be insensitive to the form of the impurity-scattering law for ux/use > 
0.7 or so. 

Although it is encouraging to find such insensitivty of the predictions 
of impurity-scattering theory to the exact form of the scattering law, a 
comparison of the predictions of (23) with experiment is desirable as a 
test of its reliability. In applying (23) to n germanium we must, of course, 
take account of the anisotropy of the valleys, and of possible anisotropy 
in the impurity scattering. We shall do this in the same way as was de- 
scribed for the moment analysis of Section VI. Thus, we shall describe 


* Ref. 28 gives a general review and references to the earlier literature, Ref. 29 
an accurate table of integrals and Ref. 30 the most complete tables of Hall and 
magnetoresistance integrals. 
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Fig. 8 — Tests of the predictions of the impurity-scattering assumption (23). 
The full curves represent the predictions of (23), the dashed curves those of (22), 
for a semiconductor with isotropic effective mass. Abscissas are ratios of H: ull 
mobility to its value for pure acoustic scattering. Ordinates are the ratios of the 
values computed for the designated quantities with impurity scattering to the 
values with pure acoustic scattering. The top curves pertain to II* — II, = (ér) 
e(er), where II* is defined by (21); the acoustic value of II* — II, QkT /e. The 
middle curves pertain to uy/u = (er?) (e)/(er)?; the acoustic i ay > 37/8. The 
bottom curves pertain to the low-field transverse magnetoconductivity ratio 
(c/unH)*(Ao/oo) = (er*)(er)/(er?)*; the acoustic value is 4/7. The points represent 
empirical values for the various etched specimens as obtained from the observed 
#u/u and the magnetoresistance constant b = (Ap/ /pHd*) 00%" with absciss: us based 
on the assumption; wy. = 54.3, 38.6, 28.0, and 16.2 & 10% cm?/vs at 63°, 77°, 94°, 
and 131°K, respectively; both ordinates and abscissas represent ratios of 'sbeueved 
quantities to values which would obtain if the w; , w2 and w; of Table X were com 
bined with the acoustic value of ra . The ratios of uyz/u and of b to their ideal 
values are rather insensitive to the anisotropy of m*/r when this is allowed for in 
this manner. 
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the anisotropy by the w;, w. and w; values given in Table X, and shall 
take the transverse magnetoconductivity of a [100] specimen to be given 
by 


wt Ao 
o(unll/c) 


(24) 


Ap “ep E + ws3)(1 + 2w;)(2 + = (ers (ers) 
p(unll/c)? : 3(1 + 2w.)? (er ,?)” ‘ 


an expression which follows from the known’ forms for Ao, ¢ and py in 
the electron-group approximation, and the further assumption (19). 
Similarly, we shall take for the Hall and drift mobilities 


r(1 + 2w») (er.’) (25 
MH & : ’ ; 20) 
le + Ww) (€T ,) 
(er, ) 
be 2+ w) | "|, (26) 
(é€) 


so that the ratio of uz to its ideal acoustic-seattering value uy, can be 
obtained from the ratio of the right of (25) to its ideal value. If now we 
assume (23) for 7, , and the w,’s of Table X, we can use the curves of 
Vig. 8, since the ratio of the magnetoconductivity or the Hall mobility 
to the factor involving w,’s in (24) or (25) is just the value for the iso- 
tropic-mass case. 

As we have no direct measurement of uve from which to compare (25) 
with (24) for a single specimen, we have adjusted this one parameter to 
fit the average behavior of the magnetoresistance of the purer specimens 
at 77.4°K. The wy. values for other temperatures were determined from 
the assumption py, « T 1-6 this exponent having been chosen to make 
the fractional departure of uy from ya scale a little more slowly than 
T *, as one expects from the Conwell-Weisskopf law. Using the ratio of 
the observed uy, to this uy, and the w; and w». of Table X, we can deter- 
mine the ratio (e7.”)(eraa)/(er2)(eTaa’) of the last factor in (25) to the 
value it has when 7, = 7a. , the value for pure acoustic scattering. With 
this ratio as abscissa, we have plotted, on the graphs of Fig. 8, the ratios 
of empirical value to acoustic value for the last factor in (24) and for 
the ratio of the last factors in (25) and (26), as obtained from the ratio 
of the low- and high-field Hall constants. Only data from etched speei- 
mens have been used, as extreme accuracy is required for a significant 
comparison; even so, there is a fair amount of seatter inthe points. Much 
of this may be due to errors in wy, resulting from errors — perhaps | per 
cent — in the assumed dimensions, or from the fluctuations in impurity 
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density; such errors will displace the uz/u points horizontally and the 
magnetoconductivity points along a line sloping at about 45° from up- 
per left to lower right. However, the curves account at least roughly for 
the at first sight surprisingly low magnetoconductivity of the pure speci- 
mens, and for the further reduction of magnetoconductivity and yy/p 
found for the less pure specimen (Sample 601). 

In concluding this digression, a few words are in order regarding the 
validity of the assumption that nothing except ionized-impurity scat- 
tering interferes appreciably with the ideal acoustic scattering law +r « 
«’. Optical-mode or intervalley scattering has sometimes been suggested 
as one of several possible causes of the departure of the mobility from 
a T- law. Though such scattering freezes out at low temperatures, it 
is easily verified’’ that the amount of it which would be needed to account 
entirely for the mobility exponent could raise uz/u by 3 per cent or so 
at 131°K, depress it by perhaps | per cent at 77°, and lower the absolute 
value of the magnetoconductivity by 1 per cent or so. The fact that, for 
Sample 606 at 131°, the ratio ux/pu is 0.992 of the acoustic-scattering 
value suggests that this and other departures from ideal acoustic-scat- 
tering behavior are small. 


7.3 Behavior of the Moments of Wpy,+ 


Fig. 9 shows the way in which the various moments of I, , as defined 
by (20) for either component, would be affected by impurity scattering 
of the type (23), if II, were « « tor € Impurity scattering, of 
course, does not affect the zeroth moment, and if I, is independent of 
energy it does not affect any moment. We have chosen as abscissa the 
same ratio of Hall mobility to ideal acoustic mobility which was used in 
Fig. 8. Fig. 10 shows, analogously, the effect of impurity scattering on the 
second and third moments of the quantity (II* — II,) « e€; the effect on 
the first moment has already been shown in Fig. 8. Comparison of Figs. 
9 and 10 with Fig. 8 shows that, for even our purest samples at 77°K, 
the effects of impurity scattering on the moments with n = 2 and 3 can 
be considerable if Il, varies as rapidly as €*. This is why we have un- 
dertaken such an elaborate discussion of these effects. 

The first step in analyzing the data of Fig. 7 is to correct II* — II, for 
the effects of impurity scattering and subtract it from the curves of Fig. 
7 to get curves representing the moments of II, and II,, , averaged 
over the purer specimens. To describe the average behavior of the purer 
specimens, we have chosen the values era’) (eraa)/(era)(€Tas') = 0.89 
at 60°, 0.93 at 77.4°, 0.96 at 94° and 0.98 at 131°. These choices, though 
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Fig. 9 — Effect of impurity scattering on the moments II, if I, is assumed 
« ¢€ '/2 (full curves) or « e'/* (dashed curves). Abscissas are the ratio of Hall 
mobility to its value for same anisotropy parameters w; and we, but ra « €!/2, 


reasonable enough, differ slightly from other equally reasonable choices 
which one might make from Fig. 8; we shall discuss the sensitivity of the 
results to the choice in the next paragraph. From these abscissas in Fig. 
10 and the top curve of Fig. 8 we have determined the corrections to 
m* — 11,'”. The resulting values of 1,.‘"’ and I1,)'" are shown in Fig. 
11 as full curves. The dashed curves are the moments to be expected for 
Ij. © €, € , ore, as determined from the values of ux/uwo just 
given and the curves of Fig. 9; the dashed curves have been fitted to the 
full curves at n = 1. 

A word about the sensitivity of the results to the choice of abscissas 
in Figs. 8 to 10 is in order. Shifting the abscissa 0.01 unit at 77° has a 
negligible effect on the moments with n = 0 or 1. It shifts the full and 
dashed curves of Fig. 11 in the same direction for n > 1, the dashed 
curves being shifted much the greater amount; for n = 3, it shifts the 
e' dashed curves by about 0.02 volt for II,), or 0.002 volt for II,. . 
At 131°K, the effect is more serious: a decrease of the assumed abscissa 
from 0.98 to 0.97, though affecting the n = 1 moments and II,,“° very 
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empirical II,, curves lie between those for II,, « ¢ and €*, and nearer 
to the former. By interpolation we estimate II,, « € *’ the exponent 
being within 0.02 or so of this value at all four temperatures. The curves 
for II,)'" have, for n > 1, roughly the behavior to be expected for 
II,y « €” (77° and 94°K), €° *® (60°) and ¢ °” (131°K); between 


n = O and n = 1 the slope is steeper, corresponding on the average to, 
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TABLE XI 
Computed fractional errors in the assumption that Q is the sum of the 
ideal Q, and Q, values which would occur in the absence of impurity 
scattering. Here 6 | Q Q ltrue — | Q lider = 6|/Q-| + 5/Q,| and 
6|Q, bv | Qp| + 6: | Q,|, where 6, | Q, | arises from the departure 
of wu, m*,(er,)/m*er,) from its acoustie-scattering value, and 
5, | Q, | arises from the corresponding departures of the II,)\”, 1,1” 
defined by (20). 
Estimated 


Abscissa for 
Figs. 8 & 9 


61 Op 
wv / deg 


606 0.91 +5 <+5 —25 0.005 

0.99 2° 0 0 0 0.000 
603 0.93 +3 <+5 —4 <+0.002 
601 0.75 ( +18 +16 —17 +0.010 
596 0.56 7 +38 +57 — 30 +-0.041 


say, «at the three lower temperatures. Considering these variations 
and the random scatter of the points in Fig. 7, we can state as our best 
estimate that II,, « ¢« °°*° at low energies and that its energy de- 
pendence probably becomes more rapid at higher energies. There is no 
reliable evidence that the energy dependence of either component 
changes appreciably in the temperature range 60° to 131°K, although 
the accuracy of the determination of energy dependence is not very good 
for II,). The ratio I,)""/Tl,."” seems to be decreasing slightly with in- 
creasing temperature, although here again it is hard to be sure that the 


effect is real. 


7.4 Effect of Impurity Scattering on the Assumed Q 


As we have mentioned in Section V, all our thermomagnetie quantities 
were measured relative to the value of Q at H = 0, and the values orig- 
inally tabulated for them were based on the value of Q characteristic of 
the observed carrier concentration and pure acoustic scattering. If im- 
purity scattering causes the total Q to depart appreciably from this as- 
sumed value, these values will require correction. The results obtained 
in this section allow us to estimate how much effect impurity scattering 
has on Q, and Q, , and so to check this point. 

According to Table LX, impurity scattering can affect Q, in two ways: 


. ° . 1 . 

through changing the anisotropy w(=w,) of m*-< ", and through its 
; 1 5): ae - . “ 

effect on the moments II,,’, Il,4 °. The effect on Q, is, of course, simply 

obtained from the top curve of Fig. 8. Table XI shows some sample eval- 
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uations of the two kinds of effects on Q, and of the effect on Q. , computed 
from Figs. 8 and 9, the w’s of Table X, and the assumptions II,, « 
e”” Tl,. « € ”. It will be noted that the change in w largely compen- 
sates the changes in II,),,°"; this explains the previously puzzling lack 
of sensitivity of Q, to impurity scattering. The fractional errors 6Q/Q in 
our assumed Q values are seen to be at most a fraction of a per cent for 
the purer specimens; this implies errors in the ordinate of Fig. 7 of rather 
less than a per cent. For Samples 601 and 596, however, at the tempera- 
tures shown, 6Q/Q is 1 per cent and 4 per cent, respectively, and 4II, 
(11* — II) is about twice as great. 

7.5 Confirmation of Impurity-Scattering Theory on Sample 601 

We have based our corrections for impurity seattering in the purer 
samples on plausible theoretical reasoning and on favorable though not 
ideally consistent evidence from isothermal electrical measurements. It 
is therefore of some interest to see if the thermomagnetic measurements 
on a sample with a sizable amount of impurity seattering differ from 
those on the pure samples, in the way in which we would predict on the 
basis of (23) and Figs. 8 to 10, assuming the energy dependences of 
II,, and I,, which we inferred from Fig. 11. 

Fig. 12 shows the comparison just mentioned for Sample 601 at 94°K. 
This sample was chosen for the comparison because good isothermal data 
were available for it and because it had about the maximum amount of 
impurity scattering for which the predictions of (23) are likely to be re- 
liable. (For some of the other samples, the electrical and thermomagnetic 
data were at different temperatures, and for 596 at 87° the divergence 
between the full and dashed curves of Fig. 7 is beginning to be appreci- 
able.) The dashed curves in Fig. 12 are the curves of Fig. 7, and repre- 
sent the mean moments of the total II* — II,,. for the purer samples. 
The plotted points represent the empirical moments for Sample 601, com- 
puted from the data of Table VI in the same way as for the pure sam- 
ples of Fig. 7. The small temperature-gradient correction just discussed 
is, of course, included in the corrections of Table VI. These points are 
to be compared with the full curves, which were constructed by comput- 
ing the differences II). "" (601) — Il). " (ideal) theoretically and 
combining them with the dashed curves. The differences in question 
were computed by the following steps: (a) From (25), the observed 
ua Of 601, the empirical w; and w. of Table X, and the assumed acoustic 


value tna (94 28 200 em*/vs (probably good to within a few per 


cent), we computed the abscissa appropriate to Sample 601 in Figs. 8 to 
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Fig. 12 — Comparison of the empirical moments II* 4°? for Sample 601 
points) with those of the purer samples of Fig. 7 (dashed curves) at 94°K. The full 
curves are moments computed from the latter, with allowance for the amount of 
impurity scattering present for Sample 601 


10. (b) From these figures and the assumptions II, « € doe Hp. & 
e«, the ratios of II,),.°"’ (601) to II,),. "" (ideal) were obtained. (ec) The 
same ratios were obtained for the average of the pure specimens, «as- 
suming an abscissa of 0.97 in the figures. ( This is larger than corresponds 
to the above wy, and the mobilities measured on the unetched samples, 
but is more reasonable in the light of the 77° data and the effect of eteh- 
ing at that temperature.) (d) From steps (b) and (¢) and the empirical 
I." of Fig. 11 the II,),.°" to be expected for Sample 601 were cal- 
culated. (e) The theoretical I* Il.“ were calculated for Sample 601 
from Figs. 8 and 10, and combined with the I,),. "Just obtained. 

It will be seen that, for Il)" especially, the empirical points fall 
much closer to the curve computed with allowance for impurity seatter- 
ing than to the curve for the pure samples. Since the theoretical differ- 
ence between the two curves is proportional to the rate of variation of 
II, with energy and to the departures of the ordinates of Figs. 8 to 10 
from unity, the agreement is a significant bulwark for the analysis of this 
section. However, it is of more qualitative than quantitative significance, 
for the inputs to the calculation were numerous and some of them a little 
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uncertain. The discrepancy for II.” may well be due to the great sen- 
sitivity of the II* — TL,” curve of Fig. 10 to the abscissa chosen. 


VIII, FURTHER OBSERVATIONS SUBSTANTIATING AND EXTENDING THE 
PRESENT MODEL 


In this section we shall discuss several parts of our data which are not 
adapted to as precise an analysis as that of Seetions VI and VII, but 
which can still be semiquantitatively explained and give indications of 
how well our conclusions can be extended to higher energies and higher 
temperatures. 


8.1 The High-Field Asymptote of BH 


So far, we have made nouse of our observations of the Nernst coefficient 
at large magnetic fields. If the asymptotie high-field behavior were ac 
curately known, it could provide a valuable extension of the moment 
analysis of Section VI. For, as (86)of Appendix B shows, this asymptotic 
behavior involves the moments (20) for n = O and —1, so that measure 
ments in two orientations could be used to determine Il)" and in 
In practice, however, the accuracy of such a determination would be 
very questionable, as it would depend on the already rather uncertain 
zeroth moments and on accurate knowledge of the asymptotic BH, 
which, as we shall presently see, is rather uncertain for various experi- 
mental reasons. We have therefore chosen merely to compare the high- 
field behavior of BH with the asymptotic behavior predicted by the 
special-case formulas of Table I], with the object of showing that there 
is at least fair agreement with the model previously deduced. 

As we have noted in Section V, the small value of the Nernst constant at 
high fields makes it very sensitive to falsification by the Hall fields of the 


longitudinal counter-currents which will be set up if surface conduction 


partially short-circuits the thermoelectric field; a surface conductance of 
a fraction of one per cent can be serious. We did not realize this soon 
enough to carry out extensive Nernst measurements on etched specimens. 
The lowest temperature at which such measurements were made was 
94°K. At this temperature the difference between the behavior of Sample 
606 etched and unetched was profound, the unetched having about twice 
the Nernst field of the etched at 18 kilogauss. At higher temperatures, 
the surface effect gets even worse, because the counter-currents are driven 
by the full Q, including the term proportional to the distance of the Fermi 
level below the band edge, whereas the Nernst effect arises only Irom 
(), and the small transport term in Q 
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MAGNETIC FIELT 


Fig. 13 — Comparison of the asymptotic behavior of the Nernst field constant, 
BH, at high fields with the predicted (B, + B,)H of the simplified asymptotic 
formulas of Table If. The three experimental curves are for two samples of differ 
ent orientations; note that the two scales are different. The short-dashed curves 
are computed from the formulas of Table III for I, independent of energy, the 
long-dashed curves for], « €~!/?. For both sets, the values w = 0.061, p = 9.8 were 
assumed. The unit of BH is volts/degree. 


Fig. 13 shows a comparison of three observed curves of BH against H 
with the leading (#7') term in the asymptotic expression calculable from 
Table II. The data for Sample 606 at 94°K are the most reliable, since 
this sample has been etched; however, these data do not extend to as 
high a value of wH/c as is obtainable at lower temperatures, so there 
may be some doubt as to how well they approach asymptotic behavior 
at 18 kilogauss. The data for Sample 606 at 60°, though on an unetched 
specimen, appear not to have been very seriously falsified by surface 
conduction, since they match the 94° data pretty well — as far as they 
go —if ordinates are scaled proportionally to Q, and abscissas to u, 
and if a little allowance is made for the electronic contribution. However, 
at the high-field end, the 60° data lie closer to the theoretical asymptote 


for Il, « e * than to that for I, independent of energy, whereas the 


reverse is the case for the 94° data; this may be due to a slight raising 
of the experimental curve by surface conduction. 
The data for Sample 603 — with H in a [110] direction — show the 
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vastly slower saturation which is to be expected from the fact that, for 
this orientation, half the valleys have a very large cyclotron mass. Since 
the curves for the two samples are plotted on different scales, the values 
of BH at 18 kilogauss and 60°-61°K actually differ by a factor of 14 in 
the two orientations, roughly the factor predicted by the theory. As 
nearly as one can estimate from the range available, the experimental 
curve seems to be heading for an asymptote closer to that for II, inde- 


pendent of energy than to that for Il, « e« *— perhaps even the other 


side of it. This is a slight discrepancy with theory which might again be 
due to the presence of a little surface conduction. 

Thus, it appears that the qualitative features of the high-field Nernst 
curves are accounted for by the theory. The curves also give some in- 
formation on the value of the anisotropy p = II,)/II,. . Since the en- 
tries in the next to the last column of Table II are almost proportional 
to (p — 1), it is probably safe to say that the present curves indicate 
a p within 20 per cent or so of the value derived in the last section. Re- 
garding the variation of the weighted average of II,, and II,, with en- 
ergy, about all that can be said is that this variation again appears not 


1 
; 


to be as rapid a decrease as e€ 


8.2 Reversal of the Sign of B with H 


All our analysis so far has been based on data at temperatures in the 
range 60° to 131°K. No comparably accurate analysis can be made 
at higher temperatures, partly because the limits of the various quanti- 
ties as H — =~ can no longer be determined, and partly because the 
value of Q, is much less accurately known than in the lower range. What 
evidence there is, however, suggests that, at temperatures in the range 
150° to 235°K, the anisotropy and energy dependence of I, are similar 
to those which we have found in the liquid-air range. This evidence is 
provided partly by the magnitudes of the low-field B and AQ/H’, which 
we shall discuss presently, and partly by an interesting reversal of the 
sign of B with increasing magnetic field. The latter effect occurs at tem- 
peratures at which the negative B.(0) and the positive B,(0) almost 
cancel each other. This cancellation occurs near 175°K, B(O) being 
greater than zero below this temperature, less than zero above it’ (al- 
though easily falsified by surface conduction ). Fig. 14 shows some curves 
of B versus H at temperatures in this range, taken on etched samples. 
The data previously reported’ on 606 and 604 unetched showed no region 
of negative B(0), but now both show sign reversals at 171° to 174°. 
What we wish to discuss now, however, is the fact that, as H increases, 





THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1959 





© 604 VT [100], H [o11]} 


| 
e 606 VT [100], H [oo1] | 


—_—__+—___4__- 4.____} 


| | | 
sal | 
| 


| 
| 
t 
} 








+ 











205°K 
— 





tee 163° K 


a te “480°K 











| T=176°K 
| 
} | 


| 
} 











| 
| 
| 
2 


| 
i 
10 12 4 


H* x10°& In GAUSS? 


Fig. 14 Variation of the Nernst coefficient B with magnetic field H, for two 
orientations, showing the sign reversal which takes place at temperatures near 
where B, and B, compensate each other. For comparison, the computed B,(178°) 
is 1.0 * 10°-°v/deg gauss 


the B of Sample 604 goes from negative to positive, and that of 606 
from positive to negative. 

Although we have not elaborated the theory of B at intermediate 
fields, it is doubtless safe to assume that, for all field strengths, B, > 0 
and B, < 0. We can get a fair idea how the ratio of B, to B, varies 
with 7 by comparing the low- and high-field limits of | B,/B, | as given 
by the formulas of Tables I and II. Table XII gives the comparison, for 
several sets of assumptions about the anisotropy and energy dependence 
of If, . We have constructed schematic curves of B, and B, against /7/, 
shown in Fig. 15, from this information and the knowledge’ that B, 
decreases monotonically with Hf for H | (001), while it has an initial 
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TABLE XII 


Comparison of high- and low-field ratios of B, (positive) to B, (nega- 
tive), for several orientations and for different assumptions regarding 
the anisotropy p II,)/I1,4 and the dependence of II, on energy e. 
The numerical values given are for w m* 7) /m*r, 0.06. 


By(#) 


Direction of Ratio of B.(a) 


II, independent of €, any p 


(001 | LH 
O11} {100} 


{O11) [O11] 


rise for H | (O11), if p II, /1I,. is large. From these curves it is clear 
that, if the anisotropy and energy dependence of HM, are similar at the 
temperatures of Fig. 14 to what we found in Fig. 11, then B should be- 
come more negative with increasing // for Sample 606, more positive for 
604. This is exactly what is observed. 

Only the crudest sort of quantitative limits can be derived, from this 
comparison, for p and the rate of variation of II, with energy. From an 
inspection of Table XII, Fig. 15 and the curves of Fig. 9 of Ref. 1, it 
seems clear that the sizes of the shifts of B with field in Fig. 14 are 
eminently consistent with the previously found anisotropy and energy 
dependence, and that they would be hard to reconcile with a TI, that 
increased with increasing energy, or with a p close to unity. Specifically, 
Table XII suggests that the theoretical curve for 606 is nearly inde 
pendent of p, so that the downward slopes in Fig. 14 must arise from an 
inverse energy dependence of I, . The swing in B between zero and 


» 


5000 gauss is of the order of a twentieth of | B,(0)  ; this would not be 


unreasonable for I, « ¢€ *. The swing for 603, on the other hand, is 
probably due to the fact that B, rises with H while B, falls; if so, it 


should be of the order of one or two times the rise in B, , a rise due 
mainly to the anisotropy of II,(p > 1). For the observed anisotropy at 
liquid-air temperature (p ~ 10) this rise (see Fig. 9 of Ref. 1), at the 
unll/c corresponding to 5000 gauss at 178°, is perhaps 15 per cent of 


B,(0). The observed swing is about twice this. 
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Fig. 15 — Schematie variation of B, and B, with magnetic field, for two orient 
ations, assuming the temperature such that B,.(0) = —B,(0). Two B, curves are 


shown for each case, corresponding to Il, « e& and e~!/*, respectively. It is assumed 
that I1,)/Il,4 > 1 


8.3 Analysis of High-Temperature Data 


The sort of moment analysis which we made in Figs. 7 and 11 gets 
increasingly difficult as 7 increases, for several reasons: high-field limits 
become unavailable; the phonon-drag effects become a much smaller 
part of the total; because of this, it is easier for such things as surface 
inversion layers to falsify measurements of bulk thermomagnetic effects. 
We shall nevertheless try to draw what inferences we can from the tem- 
perature at which B(O) changes sign, and from the totality of measure- 
ments on Sample 601 at 234°K, a sample pure enough to be fairly free of 
impurity scattering at this temperature, yet sufficiently highly conducting 
to be insensitive to surface conduction. 

As Fig. 14 shows, B(O) reverses sign in the pure, etched specimens at 
174°K (603) or 172°K (606). From the average of these and interpola- 
tions of the Q, 


13). With the acoustic-scattering value of B,, we obtain ¢, 0.226, 


values of Table VI we can deduce a value for the ¢, of 


a value close to the average ¢, of 0.229 found for the pure specimens at 
77°K, corrected for the effect of impurity scattering on B, . This suggests 


either that the anisotropy and energy variation of TH, are very similar at 


173° and 77° or that the difference in anisotropy just happens to be 
such as to compensate for the difference in energy dependence. The 131° 
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Fig. 16 — Results of an attempt to determine empirically the moments II,),12 
defined by (20) for Sample 601 at 234°AK. The points O, + and X correspond re 
spectively to three choices of the assumed II,1"); it will be seen that the O choice 
gives the most nearly plausible behavior for IT,\“" 


data of Fig. 11 suggested a possible decrease of the anisotropy with 
increasing temperature, and a possible increase in the rapidity of varia- 
tion of II,, with energy. These factors would affect ¢, in opposite ways, 
so they could cancel. 

We have attempted a moment analysis like that of Sections VI and 
VII for Sample 601 at 234°K. We adopted the plausible anisotropy 
WwW, = We Ws 0.056, (see Table X and the discussion of Section VI, 
especially the remarks about change of w with temperature). This is a 
value which happens to fit the anisotropy of the magnetoresistance. The 
data give II* — Il, 0.0507 volt, II* — Hy” 0.298 volt. The only 
additional knowledge obtainable, however, is one relation between II 
and I1,"” (to fit Q) and one relation between II)” and Il,” (to fit B). 
If, as before, we assume II,‘"’ to be linear in n, we can adjust 1," until 
the curve of Il," against n is reasonably smooth. Fig. 16 shows the 
results for II,),.°"° after subtracting off the values of II* — II,", slightly 


corrected for impurity scattering. Several choices of II,,° have been 


made, and their consequences are shown in the figure. There are strong 
suggestions that I1,)‘"/I,,‘” is still smaller than at 131°K, that II, 
varies more rapidly with energy than at 77° and that II,, continues to 
vary slowly. However, the data are so limited that only the last of these 
points can be considered reasonably established. 


IX. SUMMARY AND IMPLICATIONS OF THE RESULTS 


9.1 Hvidence on I1,)(¢€) and I,4(€) 
f I 


We have concluded that, near liquid-air temperature, the dependence 
of the phonon-drag Peltier tensor of an energy shell on the energy e of 
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0.08 


? 


. . ).3 
the shell is given, at least roughly, by Il,)(€) « e Hyste) -o.« 


while the first-moment ratio (e’Tl,))/(eTl,.) is about 9.6. These con- 


clusions were based primarily on the analysis of high- and low-field AQ 
data and low-field Nernst data on the several purest samples, with aux- 
iliary use of mobility and magnetoresistance data. The use made of the 
low-field data depended to some extent on corrections for impurity 
scattering, the method of making the corrections having been roughly 
checked both against purely electrical measurements and against alter- 
native theoretical assumptions. The data for Sample 601, which showed 
a sizable effect of impurity scattering, were consistent with the assumed 
characteristics of T,(¢€) combined with the assumed impurity-scattering 
law. The conclusions regarding II,.(€) were very insensitive to the more 
uncertain of the inputs and were consistent from sample to sample and 
from one temperature to another. Those regarding II,)(€) were more 
sensitive and showed more fluctuation, but always showed roughly the 
same behavior. High-field Nernst data, though less suited to accurate 
analysis, independently indicated the ratio of II,), to II,, to be of the 
order of 10, and also indicated a decrease of the weighted average of these 
quantities with ¢, slower than ) 

The high-field limit of the transverse AQ’s gave strong but not entirely 
certain indications that the decrease of II,) with increasing energy be- 


comes more rapid than the above-mentioned ¢« °° at high energies. The 
variation of the anisotropy and energy dependence of I, with tempera- 
ture was too small to be detected over the range 60° to 94°K. How- 
ever, we found some rather uncertain indications that II,)/I,. decreases 
slowly with increasing 7’, and that the energy dependence of II, be- 
comes steeper. These conclusions came from a comparison of 131 
thermomagnetic data with data at lower temperatures, and from studies 
of low-field thermomagnetie data at 234°. The low- and intermediate-field 
behavior of the Nernst coefficient near 175°K was found to be consist- 
ent with the anisotropy and energy dependences found from the previous 
sources, but did not suffice to check the temperature variation of these 
two factors individually. The data at all temperatures are clearly in- 
consistent with a law II,y « &, such as would result (see below) if 
the relaxation times of the phonons were independent of frequency. 


9.2 Implications for Phonon-Phonon Scattering 


The energy dependence of 11,(€) reflects the dependence of the relaxa- 
tion times 7,, of the different phonon modes on their frequency w. For 


example, if all branches of the phonon spectrum had 


w XX (funetion of direction), 
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then, since the wave numbers, or frequencies, of the modes coupled to 
electrons of energy ¢€ scale proportionally to ¢«’, we would have, as 
varies 


l 
Il, « X (a mean Tp) 
(28) 


If the longitudinal and transverse branches were to obey (27) indi- 
vidually, but with different exponents ¢; and ¢, , then the effective energy 
exponents for II,, and Il,, would be intermediate between } + 3¢, and 
y+ 3¢,, and the exponents would in general be different for II,) and 
ees 

Our liquid-air-temperature result is thus that the phonons interacting 
with a current parallel to the valley axis have an average relaxation time 


1.6 . . . . . . 
Tph & W , While those Interacting with a current in the perpendicular 


direction have an average tp, & w . These are to be compared with 


the “ideal” phonon-phonon seattering laws 


longitudinal branch: 29 


transverse branch: fi. te. 30) 


I 


valid if w < the frequency of the average thermal phonon and if only 
three-phonon noncollinear collisions contribute to the relaxation of the 
modes concerned, Estimates” of the lTrequency at which appreciable de- 
partures from (29) or (30) set in have suggested that, for germanium at 
liquid-air temperature, these laws should be approximately obeyed, but 
that departures from them may well be appreciable. As it can be shown 
that 7,, goes as a higher power of » when w gets large, it seems likely 
that the finiteness of w will lead to higher, rather than lower, negative 
exponents. Thus, at present a more plausible explanation of the observed 
exponents is that the longitudinal and transverse branches contribute 
comparably to the phonon-drag phenomenon. Since it is known’ that the 
longitudinal and transverse branches contribute comparable amounts to 
the scattering of the electrons, this would require that, in the frequency 
range most important for the scattering of electrons, the relaxation times 
of the transverse modes be comparable with or rather larger than those 
of the longitudinal modes. This is contrary to what one would anticipate 
from (29) and (30), but cannot yet be considered unreasonable, in view 
of our present ignorance about scattering matrix elements, ete. 

There are two further possibilities which would lead to departures 
from the asymptotic laws (29) and (30). One of these is three-phonon 


collisions in which the three phonons belong to the same branch and have 
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almost collinear wave vectors. If such collisions are possible, they will 
change (29) into the same form as (30). For this to occur for a low-fre- 
quency longitudinal mode, it is necessary that some higher-frequency 
mode with the same direction of propagation have a higher group 
velocity than a mode of infinitesimal frequency.” This could happen, for 
example, if the curve of frequency against wave number in the longi- 
tudinal branch curved upward instead of downward. Although this does 
not seem impossible a priori, no real crystal or mathematical model has 
to our knowledge been found with this property. Moreover, by now our 
knowledge of the elastic spectrum of germanium” is good enough to 
make it fairly certain that this does not occur for the present material. 
More intriguing is the possibility of higher-order processes, i.e., colli- 
sions involving four or more phonons,” or possibly even processes more 
appropriately describable as relaxation effects.” From the standpoint of 
collision theory, a calculation of, say, three-phonon collisions becomes 
invalid when the frequency of one of the modes involved becomes less 
than the natural widths 7,, ° of the other modes, or when the failure of 
energy conservation in an “energetically impossible’ process gets less 
than some relevant fz,, |. Moreover, even when neither of these catas- 
trophies occurs, and the three-phonon process that is being considered is 
perfectly meaningful, it may happen that the number of such possible 
three-phonon processes is so small that the relaxation of the low-fre- 
quency mode involved occurs predominantly by higher-order processes. 
However, the rates of four-phonon processes vary much more rapidly 
with temperature than do those of three-phonon processes.” At the fre- 
quencies of the phonon-drag modes, which are high compared to the re- 
ciprocal thermal relaxation time of the majority modes, relaxation can 
be described as due to some, though not all, of the collisions involving 
four or more protons. Since the temperature dependence of Q, in the 
range 60° to 90°K is not far from that to be expected for three-phonon 
processes, we must conclude that higher-order processes are at most a 
small correction in this range. But if they are at all appreciable at one 
temperature, they must become completely dominant at a higher tem- 
perature. The changeover from dominance of three-phonon processes to 
dominance of higher-order processes would almost certainly change the 
anisotropy or the energy dependence of I,(¢). In particular, if the relaxa- 
tion time of the phonon-drag modes became independent of frequency 
as it would for the simplest type of higher-order processes — II, would go 


ase, in violent disagreement with our observations at all temperatures. 


The absence of any abrupt change of the energy dependence or aniso- 
tropy of II, with temperature thus shows that, for the phonon-drag 
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modes, three-phonon processes are more important than higher-order 


processes, up to at least 234°K, and that over most of this range the 
dominance of the former must be extreme. 

It is not possible to follow the phonon-phonon scattering to tempera- 
tures much below 60°K without a theory that takes account of boundary 
scattering, an effect which spoils the crystalline symmetry of the prob- 
lem. In fact, some of the apparent changes which Fig. 11 shows in the 
behavior of II,) at 60° may well be due to the fact that boundary scatter- 
ing is beginning to be too large to be corrected for by the crude method 
of Section V. 


9.3 Transport Theory and Deformation-Potential Theory 


Although the study of the purely electrical properties of n-type ger- 
manium was only an incidental feature of the present work, our results 
have led to a number of new conclusions in this field, which we shall 
summarize here. 

To begin with, we have found considerable evidence (presented in 
Appendix A and summarized in Section I1) that in a magnetic field one 
need not abandon conventional transport theory in erystal-momentum 
space in favor of a quantum treatment until the cyclotron spacing hw 
has become close to 7’. Specifically, we found that, at hw,./kT ~ 4 and 
for acoustic scattering, the longitudinal and transverse resistivities are 
correctly given by conventional theory to within 5 per cent or so, and 
in most cases probably rather closer than this. Even at hw./kT ~ 1 our 
AQ evidence suggests, again for predominantly acoustic scattering, that 
the errors may be only of the order of 10 per cent or so. 

We have found the magnetoresistance and Hall data at high and low 
magnetic fields to be fairly consistent with the predictions of the electron- 
group theory, 1.e., with the picture which approximates the effect of the 
scattering processes by relaxation times r)(¢€) and 7,(¢). For the most 
accurately measured cases (etched samples at 77.4°K), the accuracy of 
the fit can be described as follows: The single parameter w, =m,*(er 
m)*(er.) can be chosen to reproduce the longitudinal high-field mag- 
netoresistance limits p( * )/p(0) for the four purest samples in {100}, 
[110] and [111] orientations to within the reproducibility of measure- 
ments from sample to sample (2 per cent or so). The values of p( ~ )/p(0 
for three transverse orientations also agree with predictions based on 


this anisotropy and the assumption r),, « € ’, to within a somewhat 
> 


larger uncertainty (worst discrepancy 13 per cent), due in part to sur- 
face conduction, ete., and in part to quantization. The low-field magnet o- 
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resistance constants obey the symmetry relation characteristic of the 
electron-group approximation (the relations at the bottom of Table TV 
with Ap substituted for AQ,) about as accurately as they obey the re- 
quirements of macroscopic symmetry (consistency to a few per cent ). 

Concerning the energy dependence of vr), and 7, , the most accurate 
evidence we have is that from the ratio of low- to high-field Hall constant, 
i.e., the ratio of Hall to drift mobility. As was shown in Section VIT and 
lig. 8, this ratio, when measured on etched samples, gets closer and 
closer to the value predicted for r)., « € ’ as the importance of im- 
purity scattering is reduced. We were able to conclude that only a very 


small amount of optical-mode or intervalley scattering, or of any other 


departure from the law 7). « € *, can be present at temperatures in 


the range 77° to 131°K, 

In the caleulation of the absolute value of the acoustic mobility, uw. , of 
n-type germanium from deformation-potential theory, two deformation- 
potential constants are required. One of these can be obtained with some 
awecuracy from piezoresistance measurements, and the other can be in- 
ferred if magnetoresistance or other data give a reliable value of the 
acoustic-seattering anisotropy 7/74. Our present measurements have 
yielded much better values, both for uw. and for w mi*r)/mi*r. , 
than were available when Ref. 3 was written, so it is of interest to see 
how well the deformation-potential calculation of ua now works out. 

The first step is to obtain 7/74 from the eyelotron masses” m.* 
0.082m, m,* 1.58m (assumed for the moment to be valid near liquid- 
air temperature) and the high-purity limit w 0.061) , presumably 
characteristic of acoustic scattering. The result is 7)/74 1.18. From 
this and Equations (52) and (53) or Fig. 5 of Ref. 3 (also based on the 
masses mentioned), we get the value —0.40 for the ratio Z,/Z, of the 
deformation-potential constant for stretch along a direction normal to 
the valley axis to that for the combination of a compression along such a 
direction with an equal stretch along the valley axis. From this value of 


=,/=, and the piezoresistance constant 
30,400/T 
we obtain Z,, =, , and thence the acoustic mobility. The value pre- 
dicted in this way for 77.4°K is 
15,100 em*/vs. 


The empirical value is obtained by dividing the ideal value pya/ uo 
0.933 into the observed Hall mobility, after correction for impurity 
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scattering by the estimates described in Section VII. With the value 
38,600 em*/vs there adopted for uya (see Fig. 8), we get 


Me CEEE ) 11,400 em*/vs. 


The agreement between (32) and (33) is gratifying. Although the 8 
or 9 per cent discrepancy between them is larger than the uncertainty 
in either due to uncertainties in the experimental inputs (perhaps 2 or 
3 per cent), it is of the right sort to be attributable to departures of the 
masses or deformation-potential constants at 77° from the low-tempera- 
ture values we have assumed; moreover, the inherent inaccuracy of the 
deformation-potential method could conceivably be of this same order. 


As regards the former point, we may notice that the difference between 
1.50 


rrv—1. 65 : 
and as 7 amounts to 8 per cent between 


mobilities varying as 7 
16° and 77°K. If the latter variation, which we found to be closer to the 
truth over the range 60° to 94°IX, were due to something like a tempera 
ture variation of the masses which disappeared below 40° or so, the dis 
crepancy would be eliminated. 

Regarding impurity scattering, finally, we have found in Section VII 
that it is possible to correlate with each other the effeets of ionized 
impurity seattering on mobility, Hall-to-drift-mobility ratio, magneto 
resistance, ete. —— at least in the range where impurity scattering is small 
to moderate. In this correlation the anisotropy of the impurity scattering 
Was treated as an empirical parameter; beyond this, no specific assump 
tion was necessary about the mechanism of impurity scattering, except 
that it affeeted low-energy carriers much more than high-energy ones. 
The correlations we were able to make in Fig. 8 and Fig. 12 seem to be 
quantitatively significant, but still leave something to be desired, 

All these items deserve more careful study. Our Table XV ( Appendix 
B), incidentally, which gives Maxwellian averages of powers of the 
energy, may prove handy for various types of calculations. 


9.4 Surface Effects 


We have encountered serious effeets of surface conduction in a number 
of places in the present study. Our experience justifies the statement that, 
at all temperatures, one must eliminate or allow for surface effects before 
one can get really accurate measurements of any bulk electrical property 
of high-purity material. For some properties, even the qualitative be- 
havior can be completely falsified by surface effects. Specifically, we have 
found that a surface contribution of less than one per cent to the con 
ductance can greatly affect the apparent high-field transverse magneto 





726 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1959 


resistance and the high-field Nernst coefficient near liquid-air tempera- 
ture and below, and that it can change the sign of the low-field Nernst 
coefficient in the range 175° to 250°K. Near liquid-air temperature, 
changes in surface treatment can alter the apparent conductivity and 
Hall coefficient by amounts ranging up to several per cent. 
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APPENDIX A. ORBITAL QUANTIZATION 


We give here the reasoning behind a number of statements which one 
can make, including those made in the text. We shall take up first some 
quantitative statements about the relation of the magnetoresistance of a 
multivalley semiconductor to that of the simple model assumed in the 
existing theoretical literature (items 2 and 3 below). Next we shall pre- 
sent empirical and theoretical evidence on the smallness of the effect of 
quantization on longitudinal magnetoresistance (item 4). Finally, in 
items 5, 6 and 7, we shall discuss the relation of the effect of quantization 
on AQ to that on magnetoresistance, and shall present evidence that 
these effects are fairly small even for transverse orientations. As the 
basis of the whole treatment we shall take the fairly obvious statement: 

1. For the region pH /c > 1, the orbital-quantization correction functions 
for p(H)/p(0) and Q,(H)/Q,(0) are practically the same functions of H 
as they would be if the electron-phonon coupling were allowed to approach 
ZeTO 

In other words, the difference between the true p(H)/p(0) or Q,(H) 
(),(0) and the value caleulated without quantization is a function of the 
cyclotron spacing hw, (primarily of hw./kT), which depends, of course, 
on the anisotropy of the electron-phonon interaction, but which depends 
on the magnitude of the latter (i.e., on wH/c) in such a way as to ap- 
proach a limit uniformly as the scattering goes to zero (ull /c > @~), 
This is, of course, what one expects intuitively; it follows from any trans- 
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port theory in which p(0), p(//), ete. are representable by expansions in 
powers of the strength of the scattering. 

We shall show first that, if the scattering processes are energy-conserv- 
ing collisions with a squared matrix element that is the same for all 
transitions within a valley, the following statement holds: 

2. In the approximation of isotropic scattering over an energy shell 
(1.e., 7) = Ta), the longitudinal magnetoresistance ratio p(H)/p(0) for 
any multivalley case equals the product of the value calculated in the ab- 
sence of quantization by a weighted average of the values calculated for iso- 
tropic m*’s equal to the cyclotron masses of the various valleys for the given 
direction of H, provided pH/c > a value which for germanium is of the 
order of a fraction of my*/m,*. 

Thus, for longitudinal magnetoresistance, the orbital-quantization cor 
rection can be deduced from the simple-model calculations in the litera- 


5,6,7,8 


ture.’ 

For the proof we note first that when E || H in, say, the z direction, 
P22 = 1/o;, ; this is exact for all H in symmetry directions, and asymp- 
totically exact as H — = in any direction. Therefore we need consider 
only ¢,,, Which is a sum of contributions ¢,,° from the different valleys 
i. For any such valley we can reduce the transport problem to that for 
the simple model by means of a transformation which projects the energy 
surfaces into spheres.” In considering this transformation we must re- 
member that it is not an orthogonal one, and that it therefore carries 
with it a need to distinguish between covariant and contravariant vec- 
tors. For covariant vectors, F, the transformed vector, F, is related to 
the original vector F by 


F rF, , F, | 34) 


where | and | refer to the valley axis and r m,*/m)*. For contra- 
variant vectors G the relation is 
G r "G,, G. = G. 

The sealar product of a covariant and a contravariant vector is un- 
changed by the transformation. It is easily verified that, if we regard E, 
the crystal momentum P, and the vector-potential A as covariant vee- 
tors, while regarding velocity (or j) and H as contravariant vectors, all 
the usual relations involved in the Schrédinger equation and the trans 
port equation are the same for the transformed and original vectors, 
except that in the transformed space the effeective-mass tensor becomes 
isotropic. 


Now let ¢:(H) be the conductivity in the transformed system when 
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EH, and ¢,(H) be that for E L H. These conductivities will be inde- 
pendent of the orientation of H or of the azimuth of E about H, if the 
scattering for // = 0 is isotropic over each energy shell, as is known to 
be very nearly the case for acoustic scattering in n-type germanium, and 
if at the same time any effects associated with finiteness of phonon ener- 
gies are also isotropic. Since it is known that the latter effects are impor- 
tant for incipient quantization effects in longitudinal magnetoresis- 
tance, this last assumption deserves further investigation; however, we 
shall make it and proceed. Then, for the conduetivity along /7 of all the 
valleys ¢ in parallel, we have 


o.-(H) > NS eed t of as) | (36) 


where the coefficients \,", A,') depend only on the orientation of H 
with respect to the axis of valley 7. Therefore, since &, on Go at 


H Q, 


0 (H) 
a(Q) 


Nt 


Dd Oar" + j, 


Now, in the absence of quantization, é; is of order Hf ~ or less as H— ~, 
while @, remains finite; when quantization is Just starting to be impor- 
tant and wil /e > 1, &, will still be <é,. Under our assumptions, the 
dependence of both these quantities on /7 is given by the theory for the 
simple model with isotropic effective mass. If we keep only the first term 
of (37), we have the result italicized above. For some orientations, how- 
ever, the coefficient Involving \’s in the second term of (37) is larger than 
that in the first term by a factor of the order of a fraction of m*/m *. 
This explains the qualification in the statement of the theorem. 

3. In the approximation of isotropic scattering over an energy shell (i.¢., 
T ri), any contribution to the transverse magnetoresistance ratio 
p(H) /p(0) which goes as H” must for a cubic crystal be independent of 
the orientations of E and H, in the range wH/c > 1, and any contribution 
which goes as H has only the modest orientation dependence shown in Table 
ATT. 

We interpret our assumption in the same way as for item 2 above, 
and with the same degree of risk. The proof is again quite simple if we 
use the transformations (34) and (35). Since E is covariant, H contra- 


variant, E-H 0) implies E-H 0 inevery valley. When pil /ce > | 


’ 





PHONON-DRAG EFFECT IN L-TYPE GERMANIUM 


TasLe XIII 
Orientation dependence of a hypothetical orbital-quantization term in 
the transverse magnetoresistance which is proportional to 7. The value 
r m*,/m* 0.052 has been assumed. 


Direction of 
Value of (40 
6 in p(l 
J 


\H 


(100 
O11 


Ii ~ RH), and so, with j along x and H along z, 


atl) = 7k = Pi a (38) 


where, as before, ¢;(/7) is the conductivity of the simple-model spherical- 
ized valley transverse to a magnetic field H, and the summation is over 
the different valleys. In the absence of orbital quantization, ¢, would go 
as [7 ~, and so (38) would be independent of the magnitude of H. A 
term in p,; going as H™ can arise only from a term in &, independent of 
H. If 64, is such a term, the corresponding increment in p,; is 


(a)? 


6pr+(H) R°H"6a,>, =H ; 39 ) 


But the summation is a quadratic function of the components of E with 
the symmetry of the crystal, hence it is isotropic for any cubie case. This 
proves the first statement. A term in p,, going as 7 must come from a 
term in ¢, going as /7 ', so the relative magnitudes of this term in various 


orientations must be proportional to the values of the quantity 


¥ HH ) f 
(aa) (=). 
These values are shown in Table XIII. 

1. Theory and experiment agree that the effect of orbital quantization on 
longitudinal magnetoresistance in pure germanium is very small as long as 
ho./kT S 1 or so. 

Calculations’ based on the simple-model theory of Argyres’ give p(// 
actually about 15 per cent below p(0) for ha, kT 1, rising to p(0 
for hw, kT = 2 and increasing linearly at high fields. The initial depres 
sion, however, is due to the facet that, in this theory, the scattering is 
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describable by a relaxation time r(¢) which goes to zero when ¢ = 
(n + 4)hw,. Since +‘ is proportional to the density of states in energy, 
the mean value of 7‘ in an energy range several times hw, in width is 
close to the value calculated without quantization. Because of the 


fluctuations, the mean value of 7 over such a range is greater than the 


reciprocal of the mean of 7; hence the conductivity is greater than in 


the absence of quantization, the current being carried preferentially in 
the energy ranges with the highest 7’s. Now these fluctuations in 7 will 
be considerably smoothed out if the energies of the phonons responsible 
for the scattering are even a fraction of hw, , and crude numerical calcu- 
lations’ show that a reasonable amount of such smoothing will change 
the maximum reduction of p from 15 per cent to a few per cent. Thus, 
we expect theoretically that the effect of quantization will be small for 
at least the range hw,./kT < 1. 

The highest values of ha./kT which have so far been obtained experi- 
mentally for material with essentially acoustic scattering have been those 
of Furth and Waniek” for germanium in pulsed fields at room tempera- 
ture. Their maximum field of 460,000 gauss corresponds to hw,/kT = 2.1. 
Up to this field, their p( 7) shows nice saturation, agreeing fairly well 
with the predictions of the theory ignoring quantization. By contrast, 
a combination of the latter curve with the quantization correction com- 
puted for zero phonon energies gives a curve which is rising conspicuously 
at the highest fields, and this is in marked disagreement with the obser- 
vations. Moreover, the saturation values of longitudinal magnet oresist- 
ance in both the [100] and the [111] directions agree with the predictions 
of the unquantized theory with w = m4,*7)/m*7. equal to about 0.057, 
a fairly reasonable value. 

5. In the region pH /e > 1, hw./kT < 1 and, for acoustic scattering, the 
fractional changes in Q,(H) and in p(H) attributable to orbital quantiza- 
tion either should both be «1, or else should be roughly equal. 

Since the contribution of a phonon mode q to the Peltier flux is propor- 
tional to [e(q)}rpa(q), Where ¢(q) is its velocity and 7,,(q) is its relaxa- 
tion time, the statement just made will be valid if we can show that the 
crystal momentum lost by a current to the phonons is distributed in 
nearly the same manner among the different modes, regardless of whether 
quantization is assumed or ignored. 

It is easy to see qualitatively that, as hw,/kT becomes >1, the scatter- 
ing of the electrons must involve phonons whose wave-vector com- 
ponents 1H become very large. This will undoubtedly cause the mean 
relaxation time of the phonons to become much shorter Also, the change 
in the average direction of the wave vectors of the contributing phonons 





PHONON-DRAG EFFECT IN N-TYPE GERMANIUM 731 


will alter their mean relaxation time if the individual 7,,(q) are aniso- 
tropic. Finally, if quantization changes the relative contributions of 
transverse and longitudinal branches, the average [e(q)|'rpa(q) will be 
changed. We shall try to show, however, that all these effects are rather 
small when hw./kT < 1. 

Since the theory of quantum magnetoresistance is rather complicated 
when hw,./kT < 1, we shall base our estimates on calculations of the 
distribution of crystal momentum among the different phonon modes for 
the case hw./hT >> 1. These calculations give the asymptote of the curve 
of (erpr)av against hw./kT, an asymptote which should be very nearly 
valid for hw,./kT 3. Since the curve almost certainly is monotonic and 
comes in to its low-field limit with a horizontal tangent, one can esti- 
mate the order of magnitude of its deviation from the low-field value for 
he./kT <1, or at least a rough upper bound to this. It will suffice for 
our purposes to give the argument for a semiconductor with isotropic 
effective mass, since we have shown above that, when the scattering is 
nearly isotropic over an energy shell, the transport problem for an aniso- 
tropic valley can be reduced to that for the isotropic case. 

lor E parallel to H the appropriate quantum transport theory is well 
understood. A brief inspection of the formulas given by Argyres’ shows 
that, for hw.kT >> 1, the erystal momentum in the direction of the 
current which is delivered to the phonon modes of various wave numbers 
q is proportional to 


) — hg ; —hlqr T Qy ) 
» eX < Xx : : : (41) 
sali (ae) “*P | bm *w, 


where we have taken the z direction to be that of E and H. From this, 
one can derive any kind of average which may be desired. Although our 
main interest is in an average 7), , hence in an average of something like 


q org’ with the weight (41), it will suffice to compute the more easily 
evaluated averages of g:, q,, g:, noting that the average of q ~ will 
differ by a somewhat smaller percentage from its zero-field value than 
will that of ¢. It is clear from (41) that the averages of g. and q,° will 
go proportionally to w, , while that of g, will be independent of w, . The 
most convenient quantities to tabulate are the ratios of these averages 
to the corresponding averages for H] 0; such ratios are given for 
hw, kT = 3 in the first row of Table XIV. They differ by so little from 


unity that it seems safe to assume that, for hw.kT < 1, the average 
CT ph is very close indeed to the value at // Q. 


It is worth remarking that the decrease of longitudinal resistance be- 
low the zero-field value, which the theory predicts’” in the region 
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TABLE XIV 


Ratios of the average g.”, q,*, g2 of the phonon modes, weighted in pro- 


portion to the crystal momentum they receive in the current direction, 
to the corresponding averages in the absence of a magnetic field. The 
ratios were evaluated from the high-field asymptotes for the value 
hw. /kT = 3. For the transverse case the predictions of two rival theories 


are given. Acoustic seattering has been assumed. 


onstant 


Argyres’ . ‘ 1.29 
Argyres , , 0) 
Klinger-Voronyuk® : 0) 


2, probably has no very noticeable counterpart in the be- 
havior of the (Qs ay CUCc. This decrease “rises from the current being car- 
ried preferentially in the regions of energy just below the values (nm 4 
, )hw. , Where the relaxation time is longer than average. But the changes 
in the average g, ete., for transitions at energy € as e passes through 
(n + 3 )hw, are much less pronounced than those in the relaxation time. 

Similar calculations can be made from the theory of transverse quan- 
tum magnetoresistance, except that the correct form of the theory is 


still in dispute.” The alternative versions agree in predicting that for 
hw. kT > 1 the current should be carried predominantly in states of low 
ge henee should involve transitions of low Qz - If the levels had zero 
natural width and the phonons had zero energy, the integral for the 
current in the E-direction would diverge. In the last two rows of Table 
XIV, where we have tabulated the predictions of the two types of theory, 
we have therefore used the entry “— 0” in the last column. The dis- 
tribution of erystal momentum in the current direction among the 
various modes turns out to be proportional, as far as g, and q, are con- 
cerned, to the last factor of (41) in the Argyres theory,” and to q,° 
times this factorin the theory of Klingerand Voronyuk.® While the latter 
theory spreads the g,-distribution rapidly as // increases, the over-all 
average cr,, refleets the combined behavior of all three components of 
q, und it seems likely that, for fw./kT’ < 1, the average e°7,, will not 
deviate from its zero-field value by more than a small fraction of the 
latter. 

By contrast, the values which these theories predict for the resistance 
p(H) are at least several times p(0) for ha, /kT 3, for a substance like 


germanium. 
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6. The high-field transverse magnetoresistance is so sensitive to inhome 
geneities, surface conduction and other perturbing influences that it cannot 
serve as a reliable indicator of the size of the orbital-quantization effect. 

At least, before it can be so used, many auxiliary investigations will 
have to be made to ensure the elimination of such perturbing influences. 
lor the present purpose it will suffice to consider the case where the 
normal bulk conduction is in parallel with conduction by some other 
mechanism (e.g., surface conduction) which has a widely different Hall 
effect, say zero. Slight inhomogeneities in the bulk conduction itself can 
be shown to have a similar effect,’ though they are probably not. of 
major importance for our specimens. 

Let the 2-axis be along the length of the specimen, and let the mag- 
netic field H be in the z-direction. Let p, and p, be the effective resistivi 
ties due respectively to normal bulk conduction alone and to the per- 
turbing conduction alone, and let R,, R 0 be the corresponding Hall 
constants. Then we have, assuming for simplicity that p, and p, are 


isotropic, 


RH), 
Prjne t+ Rol pr, Biden 
Krom (43) and (44) we get the effective resistivity 


Ps|Pb\ Ph Tt } RH ) | 


pr 
(RH | 
Ppp 

if p, > R,H > p,. The physical meaning of (45) is that the Hall field 
due to the b-conduction is partially shorted by the s-conduction, with 
consequent increase of the total dissipation. It p is, Say, 200 p, but 
RH /p, is 7, the second term in the bracket in (45) will be 0.25, and the 
effect on a parabolic extrapolation to infinite field, such as we have 

made in Fig. 6, will be between two and three times this. 
The etching experiments described in Section V have shown that sur- 
face treatment can alter the observed transverse magnetoresistance by a 


very sizable amount at liquid-air temperature, expecially for H | [O01], 


j | {100}. The subject merits a much more thorough investigation. How 
ever, it should be emphasized that the high-field AQ effeet is not com 
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Fig. 17 Values of the ratio of the transverse AQ(H) to Q,(0) for Sample 606 
at 20.5° and 44°K, showing the presumed effect of orbital quantization. The values 
of 'Q,(0)! are 8600 wv/deg at 20.5° and 3730 at 44 


parably sensitive, either experimentally or theoretically; this is, of course, 
because there is no large Hall field in the AQ experiment. 

7. At 20° and 44°K the effect of orbital quantization on the transverse 
(), can be identified in the data, and it is small. Since the mechanisms of 
scattering of phonons are quite different at these two temperatures, these 
small values can hardly be due to an accidental cancellation of effects of 


quantization on p(H_) and on er and it is probably safe to conclude that 


i 
ut liquid-air temperature quantization modifies p and Q, by only a few per 
cent. 

lig. 17 shows the AQ data for Sample 606 at these two temperatures. 
Both curves start to saturate as H increases, then turn upward again. 
These upturns are undoubtedly due to orbital quantization; one may also 
suspect the presence of a quantization term varying less rapidly with // 
than the conspicuous part of the upturn, say linearly. One would like to 
separate AQ into a nonquantization part that saturates when ull /c > | 
and a quantization correction dependent on hw,/kT. If, as the data sug- 
gest, a curve of the quantization correction alone against H would be 
concave upward, then we must surely underestimate the infinite-field 
limit without quantization if we assume it to equal the intercept on the 
H 0 axis of the tangent to the Q(/7) curve at its most nearly hori- 
zontal portion. Thus, the difference between this intercept and the or 
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dinate of the curve at 18,000 gauss ought to be an overestimate of the 
quantization correction. In this way we estimate that at 20.5°IX, the 
orbital quantization correction is <10 per cent of Q,(0), and that at 
14° it is <3 per cent. The reasonableness of our partition of AQ into its 
two parts is attested by the values obtained for what AQ/Q, would be 
in the absence of quantization. Our intercept at 44° corresponds (if 

AQ.| & 48uv /deg) toa AQ,/Q, of 0.21, only slightly larger than the 
values found in the liquid-air range. The intercept at 20.5° corresponds 
to a AQ,/Q, of about 0.33; this exceeds the high-temperature value by 
about the right amount to be attributed to the alteration of the phonon- 
phonon scattering law (see next paragraph ). 

Now, at 20.5°K, the scattering of the phonons involved in Q, is almost 
entirely due to the boundaries of the specimen, while at 44° the phonon- 


phonon scattering exceeds the boundary scattering. Since these two 


types of scattering have quite different dependences on frequency and 
direction, it would not be reasonable to postulate that the effect of 
quantization on the average relaxation time of the participating phonons 
at both temperatures happens to be such as almost to cancel the effect 
of quantization on the magnetoresistance. The scattering of the elec- 
trons, incidentally, can be taken as approximately acoustic at both tem- 
peratures, since at 20.5° the observed Hall mobility, 215,000 em’/vs, is 
about 0.8 of the estimated ideal value. We therefore conclude that, with 
acoustic scattering, the effect of quantization on the transverse p(/7/ ) 
p(0) is probably no more than 10 per cent or possibly 20 per cent at 
20.5°KK. This conclusion is in violent contrast to the result of direct 
magnetoresistunce measurements at this temperature, which for this 
sample (unetched!) gave p(H)/p(0 1.8 at 18,000 gauss. We believe 
the discrepancy is to be attributed to the great sensitivity of high-field 
transverse magnetoresistance to perturbing influences, especially surface 
conduction, as discussed in the Section V of the text 

From the figures given above and the plausible assumption that 
above 44°K the orbital-quantization term decreases at least as fast as 
T ', we conclude that, at liquid-air temperature and 18,000 gauss, this 
term is probably no more than a few per cent of the resistivity p( #7), and 
almost certainly is no more than this for the thermoelectric power Q(// 
or for Q,( #1). It should be remembered, of course, that the effect of the 
quantization term on the extrapolation of p or Q to infinite field, made 
on a plot such as those of Fig. 6, will be proportionally larger than the 
effect at the field at which the extrapolation commences; for our para- 
bolic extrapolation procedure, the factor is a little under 2 for a term 
proportional to /7 and a little over 2 for a term proportional to He. 
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APPENDIX B. DERIVATION OF FORMULAS 


In this appendix we shall give some of the intermediate steps involved 
in applying the procedure outlined in Section III for the calculation of B 
and AQ at low and high magnetic fields. Our objective in doing so is 
partly to clarify the procedure used, but primarily to list a number of 
intermediate formulas which are especially useful or illuminating, and to 
give the final formulas in a form sufficiently general to be applied to 
other cubie many-valley structures (e.g., silicon), and to cases where 
the anisotropies are energy-dependent 

Our starting point is the eleectron-group picture, with each ellipsoidal 
energy shell constituting a group g. The quantities we wish to calculate 
are the components of the Peltier tensor II, or, rather, the terms of order 
1, HW, HW in at low H, and those of order 1, H-' at high H. For both 
cases we shall base our calculations on (6) or (9), which relate TI to 
the partial conductivity tensors 6, of the groups. These 6, , functions of 
H, are derivable from the transport equation (10) for group g, namely, 


2 '-m*-dj 4 (‘)aj x 4 &(S) Edn, (16) 
4 €) 


where the upper and lower signs, it will be remembered, are for electrons 


and holes, respect ively. 


51 Small H, Basic Formulas 


The conductivity tensor d6\ 6, of an energy shell is the coefficient 
of E in the solution of (46) for dj. For 17 —> 0 the usual way to expand 
dé in powers of H is by iteration. In a coordinate system oriented along 
the principal axes of the villey we find for the parts containing Ff’, H’, 


IT” respectively: 


— ) ‘3 bas,H, dn, (48) 


Ma *ms* 


where da3y IS 1 when a, 8, y are even (odd ) permutations of 123, and 


Y 


is 0 otherwise; and 


+2 ( a ) bar Sar. Ts ‘ 
5 / 


Na *Mg*m* 
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B.2 Value of Q at H 0 


Let us start by computing Q for 7 = 0 in terms of the principal com- 
ponents II(e), Hale) of M,. From (6), (1) and (47) we have, for a 
cubic erystal with axially symmetrical valleys, 


: - / Hyaad aa 
TQ0 “ p at See 


. : = [ ddaa 


») a . 


* » 


ell yr) /m 2 (ell, 7. /m.” 


* . * 
(eT) /m r 2 (er, /M, 


Where, as always, the angular brackets denote Maxwellian averages. This 
equation, of course, also holds for Q, and Q, separately, with Thy. 
Hl. or H, 4. When the anisotropies p H/T,. and w ms 
m*r, are energy-independent, the expression for Q, reduces to 
'P. + 
TQ,(0) — 


€T, 7 ) 


pw) 


ol 


When IE, and ¢ depend on energy according to simple power laws, the 
Maxwellian averages occurring in these and subsequent expressions ean 


be evaluated from Table XV at the end of this appendix. 
B.3 The Low-Field Valu of b 


Krom (1), (5 and (6) we have, if H is in the z direction 


TBH = Ni [24 5 ] p 


If the erystal has over-all cubie symmetry, the evaluation of the first 
term on the right of (52) can be greatly simplified by noting that it is 
isotropic and may therefore be replaced by its average over all permuta 
tions of the directions of x, y and 2 (keeping the system right-handed, of 
course). If this averaging is done before the summing on valleys, the 
result will be the same for all villeys. lor a single valley it can be eval 
uated by taking the coordinate axes along the principal axes of the 
valley. We find in this way, using (48 


[2 M6," ) : at ea lls eryra ll, 


+ 
ICE my my "my 


where n is the total carrier density and, as usual, the angular brackets 
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represent Maxwellian averages. In the last term of (52) we have, from 


(ha). 


a } wall 


p ( 


where py is the low-field Hall mobility. We may use these and the 
pressions of Ref. 3 — also obtainable from (47) and (48) 


Dy, 


€T 


TB 


This again applies to B, and B, separately, as well as to their sum, if 
ly, and Q are given the appropriate subscript. When the anisotropies 
are independent of energy the formulas of Table I are easily derived from 
this with the use of (51). Note that the sign of B is independent of the 
sign of the carriers, since for electrons the upper sign is to be used in 
(57) with negative II’s and Q, while the reverse obtains for holes. 

All the formulas we have written thus far apply to any cubie material 
with axially symmetrical valleys. 


B.4 General Formulas for AQ at Small H 


lor the low-field AQ we must take the second-order part of (9) and 
use (48) and (49). Alternatively, we can eliminate the dé"’s in favor of 
B. For the second-order part of the thermoelectric power in the @ direc- 
tion we find, for extrinsic material, 


Qaa Ge [I,-0,"lee + Q° eo 
| - ' 
t G (#2) cp (#) ir = H. ) 
Cc c 


with the usual convention of upper sign for n type, lower sign for p. 


(58) 


The quantity paa /p is just the magnetoresistance. Our task is now to 
compute the first term of (58) for various orientations. 
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Since in cubie material three phenomenological constants suffice to 
describe Q™’ for all possible orientations of H and V7’, only three evalua 
tions of (58) need be made. We ean show further than when the valleys 
have [100] or {111] orientations in the Brillouin zone, their axial symmetry 
alone gives a further relation between the three constants, at least to 
the accuracy of the electron-group approximation, so that only two ori- 
entations need be computed. The argument, which is a generalization of 
the corresponding one for magnetoresistance, was given in Appendix C 
of our first paper.’ In terms of the q , qe, gu defined under Table IV the 
results are 


-q for |111] valleys, 59) 
0 for [100] valleys. (60) 


The evaluation of (58) can take a variety of forms, depending on 
whether @” and B are expressed explicitly in terms of the 2’s and II's, 
or are left as themselves. These forms can be converted into one another 
by use of (57) ete., and of 
pi 
Paa 7 g 


p a\” 


UF aa 
(4 | eam. 
C, 0 


The expressions for :- gaa Can be taken from the magnetoresistance 
literature, or (for [111] valleys) from (65) below or its longitudinal 


modification, by setting I = 1. 


B.5 AQ, at Small H with Energy-Inde pendent Anisotropies 


As with the Nernst coefficient, the evaluation of Q, ’ for the case of 
an energy-independent anisotropy of ¢ is a little easier than that of Q, 
When H is parallel to the direction of measurement (the @ direction 
the last term of (58) drops out and, with I, replaced by *e/e, the ratio 
of the first term of (58) to the second is a function only of the energy 
dependence of «. Using (61) we find easily, with p~ Ap and with the 


upper sign for electrons, the lower for holes, 


9 dK bey er” j Ap ; , : 
AQ. = Qeaa = —— (aw longitudinal). (62) 
« » / 5 
Ze | (er)(e) er®)(e€) | p(Q) 


For transverse or general orientations of H we must use (57) to evaluate 
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the last term of (58) and must modify the relation of — too we 


little algebra gives 


AQ @) : 3k (er) v (er) Ap 

- ciao 2 (er )(€) (er®)(e)/ p(O) 
\ CS ay ) (4) eo ao ne)), 

(eT )e (eT XE c 


‘These formulas are valid for any multivalley band structure in a cubie 
erystal. Still more general expressions for AQ, , valid for [111] valleys if 
the anisotropy of ¢ is not energy-independent, could be obtained from 
(65) below. 


(63) 


B.6 AQ and AQ, at Small H 


Because of the anisotropy of IE, , the first term of (58) is a little more 
difficult to evaluate for the phonon-drag contribution or for the total 
AQ. We may again eliminate the sum over different valleys by noting that 
AQ is unchanged if we apply any operation D of the cubic symmetry 
group both to H and to V7’, the a-direction in (58). If we average the 
contribution of each valley over all such D, the result will be the same 
for all valleys. If u”’ is the unit vector in the direction of V7’, the result 
can be written, in the principal-axis system of some one valley, 


‘ 
, a 1 ne (ellataTstn i 
u >? \1I,:6é,° |-u — Y ks 379X085 
C~ aBrys (€E)Maq Mg My 


T (T 
(AH sta ug Ds 


where ( ), means the average of the components shown over all sets, 
DH, Du'"’. 

So far, the equations are valid for any cubie multivalley material. We 
shall now specialize to valleys along the [111] axes of the Brillouin zone. 
lor this case and for H along a [100] axis, V7’ LH, (64) can be evaluated 
to 


, ne H° 
u”- >) (M,-6,"]-u” = ——, 


C 


2 (ellyry"72) 1 (ell yryra" 
re 


9 (eymy*?m,* 9 (e)my*m,* 


9 (eymy*m,*2 = =9 (e)m,™ 


" 1 (ella ~) + 3 are) |. 


When H and V7 are both along a [100] axis we find an expression of 


similar form to (65) but with the coefficients %, 4, $, 3 replaced respec- 
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9 


tively by §, —§, —%, §. These expressions apply, of course, to either the 
electron or the phonon-drag contribution, or to their sum; one need only 
affix the proper subscript to all IT’s. 

lor the most general case the expression for AQ in terms of moments 
of Il), is obtained by inserting (65), or its longitudinal modification, 
into (58); one may or may not wish to express the Q™ and B occurring 
there by (50) and (57). The magnetoresistance can be expressed if de- 
sired by (61) and (65) with TI 1, and p- and wy by (55) and (56), 
The last entries in Table IX were obtained in this way. 

If we assume that p Hy/M,. and w 7m ,*/74m* are energy 
independent, we get, after a little reduction, 


OOl 2 i 2 3 4 
| , ae ey (er )(€T (er) (ellr ) = 
AG. @, . +r — , (bob) 
él. ; ( c E 5 + 2 ( (er )° (ellr (er" 7 
Ol » 
40, | Q, Ap _ Q, ea 
OO1 p Cc 


2(2 + w)' (1 — w)(1 — pw) (er “(ell 


5 (2 + pw)(1 + 2w)? (ellr)(er?? 


0 pol \ 2(2+ w)(1 — w)] (1 — w)er)(er 
oe c 4 (1 + 2w)? (er-)° 


(2 + w)(1 — pw) (er) (ell 
(2 + pw) (ellr)(er?)” 


It is perhaps worth remarking that there exist a variety of checks 
which can be applied to calculations of this sort. For example, AQ should 
always vanish if all I, are the same multiple of the unit tensor. Again, 
setting I « m*-¢ | should give the magnetoresistance.' These and other 


tests of the same sort are very helpful in uncovering algebraic errors. 


B.7 Large H, Basie Formulas 


The leading term in the expansion of the solution of (46) in powers of 
H" is especially simple when the geometry is such that E is either 
exactly parallel or exactly perpendicular to H. As these cases include all 
the orientations dealt with in this paper, we shall give first some formulas 
based on these specializations. 

When E and H are parallel, the part of dj normal to H must be of 
order EH ', since, if it were of lower order in H-', the term of (46) in 
dj X H could not be compensated by anything else. Therefore, only the 
part dj of dj parallel to H can be of order EH". It is obtained by taking 
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the component of (46) along H: 


) € K dn 
dj x ( ( m = : (68 ) 
(€ (m*-4- ) wn 


with neglect of terms of order H '. 

When E and H are perpendicular, as they always are when H is per- 
pendicular to a symmetry plane and the total j is in this plane, we can 
start the solution of (46) by taking its cross product with the unit vee- 
tor u”’ in the direction of H. The Hall field now dominates E, so all 
components of dj are of order EH ', and the component normal to H 
measures in essence the Hall velocity cH |. We find, to the first order 
in H 


(part ot dj 1H) t eo(<) a" <x EH ‘dn Et BH. (69) 


(€) 


where the upper sign is for electrons, the lower for holes. The part of dj 
parallel to H is now obtained by taking the dot product of (46) with 


u”’ and using (69). The result of combining the two is, to the first 


HXE (m*-« ) aye nw E 5 ‘ 
u — fElH, (70) 


order, 


H 


{ * l 
\(m":¢ )aH 


where now we have used the label H X EF in subscripts and superscripts 
to designate the corresponding coordinate direction, just as we used H 


” is the unit vector in this direction. 


above; u 

The expressions (68) and (70) will suffice for the calculation of the 
high-field limit of Q; to get the high-field behavior of the Nernst effect, 
however. for Table II, we need the conductivity tensor to one higher 
order. Rather than writing this down explicitly, we shall outline the 
result of a systematic solution of (46) by iteration of the procedures just 
used. Let us write for the conductivity tensor of the shell 


dé(H ) dé + dé ee lle Sena (71) 


where the superscripts ~, —1, —2, ete. denote respectively terms of 
zeroth, first, second, ete. orders in H”'. With a corresponding notation 
for the components of dj, we find that, for k > 1, the part of dj 

perpendicular to H can be obtained in terms of dj~“*” by taking the 
cross product of (46) with u'”’, and the part parallel to H from both this 
result and the dot product of (46) with u‘”’. The general term can be 


written concisely but abstractly in the form 
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+c , € 
ee (37) “( )(G-u-s) 1.G-U-Gtdn, (72) 
eH \é) 


if k => 1; the direction of E is now unrestricted. Here S stands for the 
tensor m*:¢ |, while 


Y ‘ ; 1 H Ko 
Gas Oap — Onn Ua > Un OrB ; 


G° is the transposed tensor, and 


| , e H 
By Osyyully ° 
m 


By use of the identity G'S SG one can easily show that (72 
metrical for even hk, antisymmetrical for odd, as it should be. 


B.8 Limits of Q, and Q, at Large H 


or either Q, or Q,, or their sum, we have, from (6) and the Kelvin 
relation, the high-field limit Q™ given by 


> (M,: 6," ‘O° laa Tt > (1,6, 


where a@ labels the reetangular direction in which Q is measured, and 
where, by analogy with the notation already used, p*’ is the limiting 
resistivity tensor at infinite /7 and p'*"’ is the high-field Hall tensor (12). 

When the @ direction is parallel to H, the only nonvanishing com- 
ponent of Og ‘i is, by (68), the aa one. Moreover, in this case PBa 
viuushes for all 8, so only the first term of (75) remains. In this term we 
can replace paa by ( ye O,aa) +, SO We have 


‘gi 


«aa 


where 7 labels the different valleys. 

If the different components of ¢ all have the same energy dependence, 
the relative magnitudes of the currents (68) for shells of different ener 
gies will be the same as for H = 0, so, as (76) and (50) show explicitly, 


().(0) parallel to H. (77) 


If the valleys have [111] orientations in a cubic crystal, the general 
form (76) becomes, for symmetry directions, 
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(e(I1 py + 211 pa} [(my*/ry) + 2(mi*/r2)] y 


, 
* 


‘ ‘ l 
3¢e[(my*/7)) + 2¢my,*/r2)] ) 


f ‘ ‘ 1 
(lM para/ma*) + (e[Mpa + 2p) [(ma*/7.) + 2(m)*/r,) 


. ‘ 1 
(er, /my*) + 3¢e[(ma*/7.) + 2(my*/7))] ~) 


m)*) + 3¢elI1 py + SI pal[(my*/7y) + 8(m.*/7,)] 
Oo” , 1 
(er /my*) + 27(e[(my*/7) + 8(m.*/7.)] +) 


As usual, these equations also hold for Q, and Q, individually, if I, and 
Il, are given the corresponding subscript. Further simplifications are 
possible if the anisotropies are energy-independent, as assumed in Table 
III, or if the 7’s are xe ete. 

When the a@ direction is perpendicular to H and H is normal to a sym- 
metry plane, (70) suffices for the evaluation of (75). For this case, the 
first term of (75) vanishes, since a,” has only an [7H component and 
p. has no all component. We find easily 


’ x r 1 oe ) , 
TQaa Ny '(e) 2 (th. on Z )tas Va ak H), (S1) 
v 


S HH 


where 7 runs over the different valleys, Ny in number, and where, as 
above, S‘" is the tensor m*-* ' for the ith valley. This holds for any 
multivalley structure when the symmetry is such that E is exactly 1H. 
The entries of Tables III and IX are specializations of (81). 


B.9 Asymptotic Behavior of Bas H > « 


Let H be in the z-direction, V7 in the x-direction. From (1), (5), 
and the symmetries which we proved above from (72), we find 


_| 0(BH) 2 1 (-1 2% i Ys 
re |, | Dt ba i if _ 


where, it will be remembered, the superscripts —2, —1, *, +1 denote 
quantities going as H * H", H’,HasH — o. Here we may use (12) 
for p\*" and get o ” from (70). Since p*” has only zy and yx com- 
ponents, the first term on the right of (82) involves just ( } ae ers Vics 
Use of the explicit evaluation of 6,° ~ dé” from (72) gives 
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ce dn | ae 7] ’ Sasiys 
~ F79 ory oy 7 Sry — 5 
H*{e) : S22 ; Was 
Si: py 
ae (te. an Moz: *) (s a 
f S.. uy z 


where, as above, S is the tensor m*-< ' for group g. If the directions of 
H and V7 are such that p” is diagonal in our coordinate system, the 


second term on the right of (82) involves (Il,-¢, zy , for which we 


find 
ce dn | a 
( ; 1 , cee ¢ a5 ee z R 
I, -dé a - ( Hess a} (84) 


where, as always, the upper sign is for electrons, the lower for holes. By 
the use of (50) and the relation 


we can evaluate (82) in the form. 


1 [ a(BH) ee a 
Q(0) Ld(c/uH) Jo 





(Si + 28,-") 


9) 
(HY) ow, 
S S, 


where the superscript 7 labels the different valleys, Vy in number. This 
equation applies to B, or B, or their sum, if all IIl’s, B and Q are given the 
appropriate subseript; it is valid for any multivalley structure with 
over-all cubic symmetry if z is normal to a symmetry plane and 2 is 





along a two-fold or four-fold axis in the plane. The entries in ‘Table II 
result from tedious but straightforward specializations of (86) to [111] 
valleys and energy-independent anisotropies. 


B.10 Maxwellian Averages 


As an appendix to this appendix we list here the Maxwellian averages 


of various powers of the energy, which are needed for the evaluation of 
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TABLE XV MAXWELLIAN AVERAGES 
OF POWERS OF ENERGY 
(z)’) 

\ kT 


the various expressions we have derived when the 7’s and II’s are as- 
sumed to have power-law dependences on energy. Since the number of 
Maxwellian carriers in the energy range € to € + de is proportional to 


e exp (—e/kT' )de, we have for any quantity Z 


Sins 


Zé exp \ ~e¢ kT) de 
(Z — 


é exp (—e/kT') de 


The results for various powers of the energy are given in Table XV. 
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Stabilization of Silicon Surfaces by 


Thermally Grown Oxides’ 
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A study has been carried out of the stability of silicon surfaces when they 
are provided with a chemically bound solid-solid interface. Stable surfaces 
have been obtained with the system silicon-silicon dioxide when the oxide is 
thermally grown. This latter system has been studied in some detail. In this 
paper the following phases of our investigation are presented: (1) some as- 
pects of the thermal oxidation process and properties of the oxide; (ii) the 
electronic properties of the resulting silicon-silicon dioxide interface; (7%) 


the application of the process to devices and re sulting device characteristics. 


I INTRODUCTION 


In this introduction we will give a qualitative discussion of the prob- 
lem of stabilization of semiconductor surfaces 


1.1 Atomically Clean Surfaces 


Fig. 1(a) is a schematic diagram of a clean [100] silicon surface show 
ing the surface dangling bonds or unfilled orbitals. Since two electrons 
can occupy a free orbital, surface atoms may become negatively charged, 
thus acting as acceptor surface states [Fig. 1(b)]. The existence of these 
states in crystals was first proposed by Tamm,! based on a special one- 
dimensional model. A more general treatment by Shockley? showed that 
surface states can occur only if there is a separate potential trough at the 
surface or if the energy bands arising from separate atomic levels over 
lap. He further concluded that surface states should oecur in the forbid- 
den gap and their number should be approximately equal to the number 
of surface atoms. 

Experimentally, therefore, one expects to find that an atomically clean 


* This work was supported in part by the Department of the Army under Con 
tract DA 36-039 sc-64618 
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surface would be strongly p-type, corresponding to a surface state den- 


sity roughly equal to the density of surface atoms [Fig. 1(¢)], and strongly 


sensitive to surface effects such as chemisorption. Measurements on 
germanium surfaces prepared by the Farnsworth® technique (argon 
bombardment, then annealing at 500°C in high vacuum), of work funce- 
tion,’ ®® photoconductance,?* "9 surface conductivity?!" and _ field 
effect?’ have indicated that there is a large density of surface states 
and that these surface states are the dangling bonds which act as accep- 
tor-type states by trapping bulk electrons and forming a p-type surface. 
The data also indicate that the adsorption of gases which can bond co- 














Fig. | a) Atomic diagram of [100] surface; (b) surface states; (¢ resulting 
band bending near the surface of an atomically clean intrinsic material. 
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valently with these unfilled orbitals affects the density of surface states 
and the electrical properties of the surface. More recent measurements,!” 
still only preliminary, on cleaved germanium surfaces in high vacuum, 
appear to be in agreement with those obtained on bombarded and an- 
nealed surfaces. 

Measurements on atomically clean silicon surfaces are not yet as com- 
plete as are those on germanium surfaces. There is strong evidence that 
clean silicon surfaces are obtainable by high-temperature, high-vacuum 
heating, as indicated by field emission patterns,'*"* secondary electron 
emission!’ and low energy electron diffraction.!® From the nature of the 
surface states on germanium surfaces as outlined above, one would ex- 
pect considerable resemblance between clean germanium surfaces and 
silicon surfaces. 

From the standpoint of device technology, one may expect that truly 
clean surfaces are not desirable. A junction device would not be operative 
if its surfaces were atomically cleaned, since the resulting surfaces would 
be strongly p-type (as present evidence indicates), regardless of the body 
doping, and a low-resistance surface path would shunt the device. 

1.2 Practical Surfaces 

It is obvious from the above discussion that the usual processes of 
“surface cleaning” as used in device preparation, such as etches or rinses, 
do not provide “‘surface cleaning” in its true sense. A more appropriate 
term is ‘‘surface doping,”’ since such processes generally provide surface 
structures, complex and unknown, which are found empirically to pro- 
duce surface properties compatible with the device body properties 
(through the attainment of a suitable density, distribution and type of 
surface states, and hence the term surface doping). 

Fig. 2(a) shows a possible distribution of surface states on an atomi- 


cally clean surface. These are all acceptor-type states (neutral when un 


occupied, negative when occupied). Chemisorption of some foreign spe- 


[see Fig. 2(b 


(a) removal of some of the original acceptor states, (b) introduction of 


cies on the surface may produce the following changes 


new acceptor and donor states (neutral or positive) and (¢) introduction 
of outer states located beyond the interface, within or on the surface of 
a grown film. Due to their usually longer relaxation times, these are called 
“slow” states, in contrast to the interface or ‘‘fast”’ states [see Fig. 2(c¢)]. 

For a surface in equilibrium with the body, the potential distribution 
near the surface is determined by the density, distribution and type of 
all the surface states. In contrast to the atomically clean surface, which 


develops a strongly p-type surface conductance [Fig. 1(c)], the surface 
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conductance of a practical surface may be either p-type or n-type, de- 
pending on the existing surface states as determined by surface treat- 
ment. This important concept was first appreciated by Bardeen.” Fig. 3 
illustrates potential distributions at the surfaces of n-type and p-type 
materials when donor-type surface states predominate, hence producing 
a net positive surface charge which must be compensated by an equal 
negative charge in a surface space-charge layer. 

A point of particular interest and basie significance follows from the 
above. If a certain surface treatment produces a predominance of either 
donor- or acceptor-type states (irrespective of body doping), it is evident 
that the surfaces near the various regions of a device (p and n) can, at 
best, provide a compromise compatible with the device. For instance, for 
‘iu p-n junction with predominantly donor-type surface states (net posi- 
tive charge), the tendency is to make the p-region less p (and possibly 
inverted), while the n-region becomes more n. For a multiple junction 
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Fig. 2 a) Surface states on an atomically clean surface; (b) interface or fast 


states on a practical surface; (c) outer or slow states on a practical surface. 
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device it is obvious that the problem of simultaneous compatibility of 


the surface with the various regions may become quite critical, particu- 
larly with high-resistivity materials. This point will be considered in more 
detail later. 


1.3 Nature of the Surface Stability Problem 


Practical surfaces obtained by the usual empirical techniques involving 
etches, rinses, ete., are known to show various degrees of instability. 
They are generally sensitive to various ambients producing drifts in sur- 
fuce properties that are usually not completely reversible. There are at 
least three main reasons for surface instability: 

i. Replacement of interface impurities where an impurity B may re- 
place a previously existing surface impurity A, depending on their rela- 
tive affinities for the semiconductor. This, in general, will modify the 
distribution and density of the interface or fast states and, accordingly, 
the surface potential. 

ii. Ionization of adsorbed neutral impurities. This requires both the 
availability of active sites at the free surface and electronic exchange 
across a surface film between the adsorbed species and the semiconductor. 
Depending on its relative electronic affinity, a neutral molecule located 
at an active surface site (constituting an electronic state) will either re 
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Fig. 3 — Band bending at the surfaces of p and n semiconductors due to surface 
states that are predominantly donor-type 
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ceive an electron from the semiconductor, producing a negative surface 


charge, or give up an electron to the semiconductor, producing a posi- 


tive surface charge. Tunneling or direct conduction are probably the 
mechanisms responsible for the electronic exchange. These (i and ii) are 
the outer or slow states. 

iii. Replacement of these outer impurities may also occur, producing 
a change in the density, distribution and type of the slow states. 

In addition to the above mechanisms, one might also consider a dis- 
tinetly different process of instability which is generally operative in the 
vicinity of a junction. Here, surface ions produced by processes other 
than eleetronie exchange with the semiconductor, such as hydrolysis, 
cte., can migrate on the surface under the influence of the fringing field 
of the junction. A second paper to be published is entirely devoted to a 
detailed study of this process. 


Il. TECHNIQUES FOR MEASUREMENT OF SURFACE STATES 


In the course of this work several techniques have been used for meas- 
uring the detailed surface properties of silicon surfaces, particularly as 
related to surface states. These included the large-signal ac field effect 


8 


technique,'*'* the large-signal ac field effect technique with superimposed 
de bias, a zero-frequency four-point probe field effect technique, the sur- 
face photovoltage technique?® and the n-p-n or p-n-p channel tech- 
nique.”!.* The majority of the results presented here, however, have been 
obtained by the field effect techniques. It seems in order, therefore, to 
present here a brief review of the basic principles and concepts underlying 
field effect work. 


2.1 Basic Conce pls of the Field Effect 


The object of this technique is to determine the density and distribu- 
tion of surface states. Application of a field transverse to the semicon- 
ductor surface produces a change in the band bending near the surface 
to maintain charge neutrality. This, in effect, alters the location of the 
Fermi level at the surface and, accordingly, changes the occupancy of 
the surface states. One measures changes in surface conductance with 
field (or plate voltage), from which the density and distribution of states 
are obtained. 


2.1.1 Field Effect with No Surface States 


Fig. 4(a) shows the bands near the surface of an intrinsic semiconduc- 
tor which are straight in the absence of surface states or charge on the 
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field plate. The conductance of a region near the surface is due to elec- 


trons in the conduction band and an equal number of holes in the va- 
lence band. When a voltage is applied to the plate with respect to the 
semiconductor the bands near the surface must adjust to the point where 


the net charge in the semiconductor is equal and opposite to that on the 
plate, Fig. 4(b). If AN and AP are the changes in the number of elec- 
trons and holes per unit area, due to plate charge Qp per unit area, the 
corresponding change in surface conductance AG is given by 


AG = q(u,AN + yw,AP). (1) 


One defines the derivative dG dQ, , as the ‘field effeet mobility” up, 

It can be shown" that AG exhibits a minimum value that corresponds to 
a unique potential distribution near the surface for each body resistivity. 
For intrinsic material, the bands are bent by an amount Y —In b, 
where 6 = yu, /u,. If the bands are so strongly bent in either direction 
that the number of one type of carrier is negligibly small as compared 
with the number of the other, then a change in field-plate charge will 
induce an equal and opposite change in the majority carrier and the field 
effect mobility approaches the surface mobility of that carrier. Fig. 4(¢) 


al 
































Fig. 4 Field effect in the absence of surface states 
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shows the variation of conductance with plate charge (equal and oppo- 
site to space charge Q,.) with the corresponding values of surface poten- 
tial Y [in (kt)/q units]. In Fig. 4(d) the changes in field effect mobility 
with surface potential are shown. For any other resistivity one can cal- 
culate the variation in surface conductance with charge and the corre- 
sponding surface potential using the space-charge equations derived by 
Garrett and Brattain,® which are also conveniently available in graph 
form.4 In many cases, one needs to include the effect of surface scatter- 
ing on carrier mobility. No direct measurements are available, however, 
and the common practice is to apply Schrieffer’s theoretical results.*® 


2.1.2 Field Effect with Surface States 


In the presence of surface states, part of the charge on the field plate 
will terminate on charges in surface states. If one now changes the charge 
by some amount, both the semiconductor space charge and charges 
in the surface states will change. The resulting change in surface conduc- 
tance, however, will always be smaller than one which would be obtained 
in the absence of the surface states. The experimentally obtained curve 
of AG versus Q, is generally wider than that calculated with no surfaces 
states. Experimentally,'*!’ one varies Q, at some frequency and observes 
on an oscilloscope the variation of AG versus Q, . Only states with relax- 
ation times shorter than the period of the frequency used will be effective. 
Slower states will only contribute a fixed charge. 

The point of minimum surface conductance must be obtained in order 
to provide the reference point for the quantitative evaluation of the re- 
sults."* It has been pointed out in the preceding section that the mini- 
mum conductance corresponds to a unique value of the surface potential 
Y for a given resistivity. Furthermore, the conductance in general is a 
unique function of Y for each resistivity. Therefore, one adjusts the ex- 
perimental curve of AG versus Q, with the theoretical space charge 
curve along the AG-axis until the two minima are matched, as shown in 
Fig. 5. It must be noted that their lateral relative position is arbitrary. 
Now one transfers the points of various values of Y on the theoretical 


curve horizontally to the experimental curve, thus providing the value 


of surface potential at each point on the curve. Furthermore, the hori- 
zontal segments between the two curves at each value of Y represent 
essentially the net charge in surface states Q,, (difference between total 
charge or plate charge and charge in space charge) with an unknown 
additive constant. From plots of dQ,,/dY versus Y, one may then fit 
the results with an approximate distribution of surface states. Rigorously 
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speaking, an exact distribution is only obtainable if the experiment is 
varried out throughout the range of Y = + .?6 


2.1.3 Effect of Slow States 


If one applies a step voltage to the plate, one will first get a corre- 
sponding change in surface conductance resulting from a change in sur- 
face potential, in conformity with the fast state occupancy requirements 
discussed above. If no other states are present, the new value of surface 
conductance would be maintained indefinitely. However, if slow states 
are present, their occupancy, which is determined by their location with 
respect to the Fermi level, must gradually change to correspond to the 
new equilibrium condition. Since the total charge in the surface states 


and the space charge must remain constant (equal and opposite to the 


plate charge), a change in occupancy of the slow states will cause a gradual 
redistribution of charge in the space-charge region and in the fast sur- 
face states. It is obvious that this will always correspond to a decay in 
the initial change in conductance. If the density of the slow states ex- 
ceeds the plate charge density, the decay is almost complete, with a time 
constant of the order of the relaxation time of the slow states. Thzs 7s 
evidence of electronic exchange between the semiconductor and the slow or 
outer states. The same phenomenon is also observed in three different 
ways in ae field effect measurements. First, if one superimposes a de bias 
on the ac field signal, one observes an immediate shift of the field effect 
curve towards p or n, depending on the polarity of the de bias. This is 
followed by a slow decay towards the initial curve. Release of the de 
bias will simply reverse the effect, with a final return to the initial con- 
dition. Second, by changing the ambient, one may observe a shift to 


Fig. 5 Determination of charges in surface states from field effect experiments 
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Mig. 6 — Cireuit diagram of the zero-frequeney four-point probe technique 


wards p orn, depending on the ambient.’ This is an indication of a 
change in the density of the slow states and in the magnitude of the fixed 
charge they represent. It is a further indication of the ionization of neu- 
tral adsorbed impurities by electronic exchange with the semiconductor 
through the surface film. Third, if the frequency of the measurement is 
reduced to the range of frequencies of the slow states, one observes 
smaller changes in conductance corresponding to very low field effect 
mobilities. 


2.2 Zero-Frequency Four-Point Probe Field Effect Technique 


In many cases, particularly for device purposes, one requires a meas- 


urement of all the states, fast and slow. This may be done by the ac 
method, by repeating the measurements at various frequencies. Another 
alternative is the zero-frequency* four-point probe technique, shown in 
Fig. 6. The outer electrodes are used to feed in the current and the volt- 
age drop is measured across the inner electrodes. The voltage drop across 
the specimen is initially balanced against a reference resistor F, in series 
with the specimen. The change in conductance AG produced by a field 
is obtained from the voltage difference AV between the specimen sand 
the reference resistor, | Vo — V, 
AV (L/w) 


AG mhos/O. (2) 
IR 


* The term ‘‘zero-frequency”’ is used to distinguish between this method and 
the conventional de field effect method used to observe conductivity decay 
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where L and w are the length and width of the specimen, /o is the cur- 
rent and F, is the sample resistance. The plate charge is obtained from 


the voltage applied and the measured capacity of the plate specimen. 


2.3 Characterization of a Surface Treatment 


A complete characterization of a surface is a detailed description of 
the type, density and distribution of all the surface states. This allows 
one to predict the surface potential obtained for any resistivity material 
as a result of these same surface states. In many cases, however, partic- 
ularly in device applications, it may suffice to know whether a certain 
surface treatment will or will not produce surface inversion. To extra- 
polate field effect results obtained on a material of some resistivity to 
other resistivities, one must show that the surface treatment and the 
resulting fast surface states are not dependent on concentration and 
type ot body doping. 

Fig. 7(a) shows a field effeet curve produced at zero-frequeney and 
hence tneludes all the states. Point 0 corresponds to zero charge on the 
field plate. Point 1 corresponds to an intrinsic surface or Y uy , Where 
Up Ind and d Po n;, With po the hole density in the body and n 
the intrinsic density. Fig. 7(b) corresponds to the ‘onset of surface in- 
version”. To change the surface potential from its initial value Y» to 
the onset of inversion Y; one must apply a charge —Q)_,; to the plate. 
Invoking the condition of charge neutrality at point Es 


iQ..)7 = Qe.r — 1Q.1) (3) 


c ie fy 


Where Qo_; is measured and [Q,,|; is caleulated for the particular resis- 


tivity of the specimen used. This gives [.V,.J, (Q..1:/q, the absolute 











Fig. 7 Determination of type and strength of surface treatment 
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Fig. S—- Net surface charge required for surface inversion. 


number of charges in all the surface states when the Fermi level coincides 
with the intrinsic level at the surface corresponding to the onset of inversion. 

Now, for any doping, the number of charges in the space charge at 
the onset of inversion is given by”? 


(N,. )y nil Oe Se 1) +- 


Hence, for any resistivity material, the surface states will just bring about 
the onset of inversion if (N,,),, obtained experimentally, is equal to 
(N,.), in (4). For higher resistivity, inversion will occur and, for lower 
resistivity, inversion will not occur. Fig. 8 is a plot of (4) for both p- and 
n-type silicon. 

The procedure we have used for characterizing the effect of a surface 
treatment is simply as follows: 

i. Determine [N,,],; from the field effect data as outlined above. If 
this number is positive, it indicates that inversion will occur on n-type 
material. 

li. From Fig. 8 determine po or no corresponding to [NV,,]; . This deter- 
mines the limiting resistivity below which inversion will not occur; all 
materials with resistivities above this value will form inversion layers. 
Now we will introduce some terminology which concisely describes the 
properties of an oxidation treatment as related to inversion: p-type oxides 
or n-type oxides — this indicates the tendency of the oxidation process 
to induce a p-type surface or an n-type surface, respectively, or to induce 
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inversion in n-type material or in p-type material, respectively. The 
strength of an oxide is defined as the limiting resistivity above which in- 
version will occur and below which no inversion will occur. To illustrate, 
let the oxide be characterized as n-type oxide of strength 10 ohm-cm. 


This means that this oxide produces sufficient surface states to invert 
the surface of all p-type material of resistivity above 10 ohm-cms. 


Ill. THE OXIDATION PROCESS 
3.1 Introductory Remarks 


In the search for a coating for stabilization of silicon surfaces, several 
possibilities were considered. These included two general categories: 
coatings that are deposited by methods such as dipping, painting or 
evaporation followed by various processes such as baking, and coatings 
that are grown from the silicon surface itself. The first category generally 
suffers from such difficulties as: (a) ionic impurities (remembering that 
1/10,000th of a monolayer is sufficient to invert the surface of | ohm-cm 
silicon) which will drift with field, producing device instability; (b) lack 
of uniformity of chemical bonding (if there is any bonding) at the silicon 
film interface, particularly with an ill-defined initial silicon surface; (¢) 
poor structures of the coatings from the standpoint of permeability to 
various ambients and (d) various incompatibilities such as dielectric 
strength or coefficients of expansion. 

For coatings grown out of the silicon surface, we have restricted our 
efforts to silicon dioxide. We considered two alternatives: oxides grown 
anodically in electrolytes and thermally grown oxides. Measurements of 
their transverse conduction indicated substantial differences. The resis- 
tivity of the anodie oxide is of the order of 10" ohm-em, while that of 
the thermal oxide (grown at about 1000°C) is of the order of 10'®. Fur- 
thermore, in the anodic oxides, part of the conduction is due to mobile 
ionic impurities of the order of 10° per em’, while the ionic content of 
the thermal oxide is less than 10" ions per em’. 


3.2 Preoxidation Treatment of Silicon Surfaces 


Preliminary studies, using various silicon diodes, indicated a strong 
dependence of reverse characteristics on the surface treatment preceding 
the oxidation process. Impurities left on the surface can be partially 
maintained at the Si—SiO, interface through the oxidation process, and 
may affect the interface characteristics and the device characteristics. 
To illustrate, by purposely varying the preoxidation treatment on the 
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same junctions, it was possible to obtain a range of reverse current of 
10°!’ to 10° amperes. This work provided an extensive empirical back- 
ground on the effeet of preoxidation treatment and indicated the desira- 
bility of obtaining as clean a Si—SiOsz interface as possible. Accordingly, 


all the details of the procedure prior to and during the oxidation proe- 


ess were chosen to achieve this objective. A suitable preoxidation process 
must be capable of: (a) removing surface imperfections, damage, ete.; 
(b) removing the bulk of organic surface residues (weakly bound); (c¢) 
removing chemically bound organic substances; (d) removing metallic 
impurities and (e) providing in a controlled fashion a lightly oxidized 
surface prior to thermal oxidation. 


3.3 Monitoring the Preoxidation Process 


In exploring the effects of preoxidation treatments, it appeared de- 
sirable to have some simple monitoring technique to indicate qualita- 
tively certain effects of the various treatments on some property of the 
surface. A technique Was used which determined whether the surface was 
hydrophobie or hydrophilic. The technique is based on the water break 
and water spray tests, whereby the contact angle of water droplets de- 
termines whether the surface is wettable (hydrophilic) or not (hydro- 
phobic). The surface to be examined is dipped in liquid nitrogen for 
about 10 seconds, then mounted under a high-power microscope (400 X ) 
in a closed chamber where wet nitrogen is circulated. One first observes 
a thin sheet of ice forming uniformly on the surface. After the ice melts, 
a distinctive pattern of water droplets forms which remains for a few 
minutes before final evaporation. It has been observed that the shape 
and size of the droplets and the uniformity of the pattern is sensitive to 
the surface treatment. This process may be repeated (without re-treating 
the surface) about five to ten times before observable changes in the 
pattern start to occur, indicating surface deterioration. 

The following is a summary of our observations: (1) A silicon surface 
that has been freshly etched in HNO;-—HF mixtures is strongly hydro- 
phobic. A typical pattern of this surface as obtained by the “liquid ni- 
trogen test”? is shown in Fig. 9(a). The droplet size is of the order of 
0.0001 inch. (ii) After the etched surface has been boiled in deionized 
water for as long as one hour, the surface remains hydrophobic. The size 
of the droplets, however, may slightly increase. (1) After having been 
boiled in organic solvents the surface remains hydrophobic, although the 
shape and size of the droplets may change, [Fig. 9(b)]. (iv) Boiling in 
water containing detergents does not produce a hydrophilic surface. (v) 
When the surface is heated in hot oxidizing agents such as HNQOs for 
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Fig. 9 — Silicon surface examination by the “liquid nitrogen test’’: (a) after 
exposure to HF; (b) after subsequent boiling in xylene; (c) after boiling in nitric 
acid, 


10 minutes or more the surface becomes uniformly hydrophilic. Fig. 9(¢ 
shows the optical interference pattern of such a surface just before com- 
plete evaporation of the once continuous film of water. (vi) After having 
been boiled in water this hydrophilie surface remains hydrophilic. (vii) 
If, however, the hydrophilic surface is exposed for a few seconds to HF 
vapor, it reverts to a strongly hydrophobic surface. (viii) If the hydro- 
philic surface obtained by the HNO, treatment is exposed to room air, 


it becomes gradually and nonuniformly hydrophobic. Boiling the exposed 


surface in water or organic solvents will not restore the surface to its 
former hydrophilic state. (ix) A thermally oxidized surface is hydro- 
philic, and shows similar behavior to the nonoxidized one when treated 
with HNO, exposed to room air. A major difference, however, is that 
its original hydrophilic state can be largely restored simply by rinsing 
in an organic solvent. This suggests the relative inertness of an oxidized 
surface compared to a freshly etched surface. 


3.4 The Thermal Oxidation Process and Properties of the Thermal Oxide 


Silicon surfaces were oxidized at temperatures in the vicinity of LO00°C 
and at atmospheric pressure in both dry and wet oxygen. The oxidation 
rates in oxygen and in water vapor have been measured* using a vacuum 
microbalance technique. For the range of film thicknesses of about a 
hundred to thousands of angstroms the oxidation process follows the 
following parabolic laws: 


Oy: X® = 84 X 10" plist t0/Kr, 
H,O: X? = 2.54 X 10% p/p e174 
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where k is the Boltzman constant, 7’ the absolute temperature, X the 
film thickness in angstroms, p the pressure in atmospheres, ¢ the time 
in minutes and g the electronic charge. It is of particular interest to note 
that the activation energies are equal for both Oz, and HO oxidation, 
suggesting that the two types of oxides have the same structure. The 
high activation energy of 1.7 electron volts is also significant, since it 
presumably corresponds to extraction of silicon from the lattice and its 
diffusion in the oxide network. 

The following are some properties of the thermal oxide: 

i. Electron diffraction studies of thermally grown silicon oxides have 
indicated no crystalline structure. The film is essentially continuous and 
amorphous. 

ii. The dielectric constant of the film is about 4. It has a resistivity 
of about 10'© ohm-cm, measured in a transverse direction to the oxi- 
dized surface. The dielectric strength of the film is between 5 and 10 x 10” 
volt/cem. 


iii. If the surface impurities are not carefully removed prior to oxida- 


tion, oxide crystallites will form on the surface that become visible under 
the microscope if the film grown is of sufficient thickness. This provides 
rather serious discontinuities in the film. 

iv. Discontinuities in the film are also observed at some surface im- 
perfections, due either to the nonuniform growth of the oxide at the im- 
perfection or to impurities trapped at the imperfection and subsequently 


MAG 500 X 


Fig. 10 — Oxide imperfections as revealed by the hot chlorine technique 
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affecting the local oxidation. These local imperfections are made beau- 


tifully visible, even for films a few atomic layers thick, by the ‘chlorine 
etching” technique. At a temperature of about 900°C, chlorine gas will 
etch silicon at a rate of the order of 0.001 inch per minute but will not re- 
act with silicon dioxide. To test for film imperfections, the oxidized spec- 
imen is placed in the chlorine oven for a few minutes, allowing the chlo- 
rine to etch through discontinuities in the oxide. Fig. 10 shows a 
chlorine-treated oxide surface indicating some imperfections along 
well-defined surface damage. 

v. Uniform oxides have been obtained consistently. They showed few 
imperfections as indicated by the chlorine test. Oxidized surfaces stored 
for more than 12 months before exposure to the chlorine test have in- 
dicated the permanence of film uniformity. Furthermore, oxidized de- 
vices given the 900°C chlorine test indicated no change in their electrical 
characteristics. 


IV. FIELD EFFECT MEASUREMENTS ON THERMALLY OXIDIZED SILICON: 


Both the large-signal ac field effect technique, using a frequency of 40 
to 500 eps, and the zero-frequency four-point probe field effect technique 
were used. The sample resistivities ranged from 80 to several hundred 
ohm-ems, both for p-type and n-type silicon. Pulled crystals, rotated 
and nonrotated, as well as floating-zone crystals were used. 


1.1 Koidence for the Absence of Slow States 


Fig. 11(a) shows the ae field effect pattern obtained at 70 eps for a 
300-ohm-em p-type pulled crystal with an oxide thickness of about 500 


Fig. 11 Large-signal ac field effect effect of a superimposed de bias 
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angstroms. The field-plate signal was +280 volts. For an etched silicon 
surface, if one superimposed a de bias on the ac signal, the pattern would 
shift to one side or the other, then drift back within about a minute to 
the initial pattern. For the oxidized silicon, however, an application of 

90) de volts produced the shift indicated in Fig. 11(b) and +90 volts 
produced the shift indicated in Fig. 11(¢), all figures having the same 
calibrations. If one superimposes all three curves, one finds that the 
actual measured shift of the curves corresponds closely to the 90 volts 
which have been applied. This indicates that the usual slow states observed 
on unoxidized surfaces have been virtually eliminated. Furthermore, the 

90 volts de bias was left on for two hours with no change in the field 
effect pattern, which is shown in Fig. 11(d) at the end of two hours. To 
determine the possible existence of slower states, we have also maintained 


a de field in a four-point probe experiment (discussed in a later section) 


for over 3000 hours with no measurable decay in surface conductance. 
To explore the possibility of states with relaxation times corresponding 

to higher frequencies than the 70 eps used above, a frequeney run was 

performed. This was done on a 300-ohm-cm p-type rotated crystal with 


an oxide thickness of 180 angstroms. The field effeet pattern was ob- 


signal ac field effec effect of frequency 
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served at 11 different frequencies from 40 to 500 eps. Fig. 12 shows the 
patterns obtained at 40, 200 and 500 eps, all drawn to the same scales. 
Due to distortions at the higher frequencies, lower plate voltages had to 
be applied at 200 and 500 eps. Here again, if one superimposes all three 
patterns by making the midpoints coincide, one obtains one identical 
curve. This shows that no slow states (or not-so-slow states) are observ- 
able up to 500 eps. Unfortunately, our measuring set is limited to 500 
eps and, accordingly, we have no knowledge of the lower limit of the 
relaxation time of the surface states. 


1.2 EN ffect of Ambients 


The experiments reported in the above seetion were performed in dry 
oxygen. To study the effeets of ambients, the following gases have been 
circulated: dry oxygen, dry nitrogen, wet oxygen and wet nitrogen (~40 
per cent relative humidity), ammonia and ozone. No shifts in the field 
effect pattern were observed over several hours. On replacing the oxi- 
dized sample with etched silicon and germanium samples, shifts in the 
field effect patterns were observed in accordance with the usual experi- 


ence, 


1.3 Effect of Presence of External Impurities During Oxidation Vol-So- 
Fast States 


This effect was first observed accidentally when the oxidation tube 
Was cracked and impurities from the heating elements of the oven were 
present during the oxidation process. The general effeet is to produce 
“nol-so-fast states,” as indicated by both the zero-frequenecy technique 
and the ac technique. The zero-frequency technique indicates a very 
small change in conductance with applied field. The ac field effect teeh- 
nique, however, by simple changes of the frequency, shows the effects of 


the not-so-fast states. Fig. 13 shows the results obtained from a sample 


Fig. 18 Large-signal ac field effect effect of frequency when not-so-fast 
states are present 
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Fig. 14 — Zero-frequency four-point probe field effect — etched silicon surface. 


showing this effect. From bottom to top the frequencies are 25, 50, 75, 
100, 150, 200, 250 and 300 eps. As shown, the tendency is for the sur- 
face to become more p-type with increased frequency, indicating that 
the responsible states are mainly donor type (neutral or positive). It is 
of further interest to note that most of the change in the pattern oc- 
curred between 50 and 300 eps, setting a range of relaxation times for 
these states between 3 and 20 milliseconds. 


1.4 Zero-Frequency Field Effect Measurements on Oxidized Floating-Zone 
Silicon 


Kirst we will demonstrate the typical behavior of etched silicon sur- 
faces when they are examined by the zero-frequency four-point probe 
technique. Fig. 14 shows, for a p-type silicon specimen, a typical decay 
with time of surface conductance of 180 ohm-cm produced by the float- 
ing-zone technique.* The change in conductance decays to practically 
zero in a little over one minute. The plate voltage was — 200 volts, cor- 
responding to a plate electronic charge density of 7 X 10!°/em?. On re- 
moval of the plate voltage, the reverse effect is obtained. 

Samples of the same crystal were oxidized to various oxide thicknesses. 
The field effect results were quite reproducible for each sample and from 
sample to sample. In all cases, no decay in the change in conductance 
was observed. Fig. 15 shows the results from one unit, plotted as the 
change in surface conductance versus plate charge. The calculated space 


* These samples had 22 zone passes in a hydrogen atmosphere. It is believed 
that most fast-diffusing impurities, as well as oxygen, have been removed. 
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charge plot is also shown, from which the values of surface potential 
were obtained. They are shown on the experimental curve. Note that, 
with this technique, the lateral location of the theoretical space-charge curve 
is absolute. The horizontal segments between the theoretical and the 
experimental curves represent the total net charge in surface states (and 
also fixed charges if present — which is not the case, as will be shown 
shortly from data on the effect of time). The point of intersection cor- 
responds to zero net charge in the surface states; to the right, there is a 
net positive charge and, to the left, a net negative charge. From this, it is 
concluded that the surface states present are of two types, donor and ac- 
ceptor. By analyzing the data in the usual way, the following approxi- 
mate distribution of states was obtained: 

i. An acceptor level located at about 0.4 electron volt below the in- 
trinsic level of density of the order of 10"/em?. 

ii. A donor level located higher than 0.3 eleetron volt above the in- 
trinsic level of density of the order of 10!°/em?. 

ili. A nonuniform and less significant distribution near the center of 
the band of average density of the order of 10°/em? volt. 

For this high-purity floating-zone material, the general tendency after 
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Fig. 15 — Zero-frequency four-point probe field effect thermally oxidized 
high purity p-type crystal. 
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oxidation is to produce a p-type surface, indicating the predominance of the 


acceptor-type surface states. In terms of oxide type and strength, discussed 
previously, these results indicate a p-type oxide of strength of about 3 
ohm-em; i.e., the same surface states would invert all n-type silicon sur- 
faces with resistivities above 3 ohm-ems. 

To determine whether there is any significant number of very slow 
states within or on the surface of the oxide, a sample was given a life test 
in which, after its field effect characteristics were measured, a de bias of 
100 volts was left on (corresponding to a plate electronic charge of about 
1.5 & 10"/em?). Periodically, the surface conductance was remeasured 
and the field effeet curve was re-obtained. The test has so far exceeded 
3000 hours with no detectable change in the conductance or the field 
effect curve. Since the film dielectric constant and resistivity are 4 and 
10'° ohm-ems, respectively, its relaxation time xp/4m (in electrostatic 
units) is of the order of one hour. One concludes, therefore, that the 


number of outer or very slow states is insignificantly small. 


1.5 Surface Properties of Oxidized Silicon Pulled Crystals; Effect of Body 
I mpur lies 


Field effect experiments have also been performed on more than 20 
samples from a number of high-resistivity pulled crystals, nonrotated 
and rotated at different rates during growth. They generally exhibited, 
when oxidized, the same general behavior as far as the absence of slow 
states, high field effect mobility, and insensitivity to ambients. One very 
significant difference that was observed; however, was that the surfaces 
obtained were generally n-type; i.e., the oxides were n-type and had a wide 
range of strength from 0.4 to 20 ohm ems. This indicates that, in contrast 
to high-purity floating-zone crystals, the donor-type surface states are 
more predominant than the aeceptor-type states. The variations in film 
strength were observed both from erystal to erystal and from sample to 
sumple in the same crystal. In many cases, the surface was so strongly 
n-type that no conductivity minimum was observed even when a plate 
signal corresponding to about 2 10" eleetron charges per em? was 
applied, indicating a density of donor-type states in excess of this figure. 
To show that the n-type surface was induced by charges in surface states 
rather than by some body doping in a small region near the surface, the 
oxides were removed by exposure to HEF vapor for 30 seconds, followed 
by a water rinse. Thermal probing of the surface indicated p-type mate- 
rial, and ae field effeet observations indicated the disappearance of the 
initial n-type surface. 


In view of the above results on the high-purity crystal, it was further 
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concluded that a strong n-type surface, as is obtained with pulled crystals, 


is not a characteristic behavior of thermally grown silicon oxide. To ex- 
plain this behavior of pulled crystals, the main remaining possibility is 
crystal impurities, particularly the fast diffusants. This possibility has 
been supported by two main observations: 

i. Pulled n-type erystals have shown, in the majority of cases, rather 
large increases in body resistivities during the oxidation process (as high 
as two orders of magnitude). These were observed when the samples 
were quenched from the oxidation temperature to room temperature. 
This change, however, will decay at room temperature, and the decay is 
nearly complete in the order of an hour. At 200°C the decay is complete 
in less than 10 minutes. This phenomenon is not understood, but it is 
presumably due to some body impurities producing donor-type levels,?" 
possibly through formation of complexes with oxygen in the erystal or 
precipitation of acceptors.*° In our work with high-purity floating-zone 
crystals, on the other hand, these changes in body resistivity were either 
absent or comparatively small, the largest observed increase being about 
15 per cent. Therefore, it seems quite plausible that the source of the 
high density of donor-type surface states observed with pulled crystals 
is fast-diffusing body impurities which are gettered by the oxide. 

ii. The second observation was a result of the following experiment. 
A 10,000-angstrom gold film was evaporated on the same high-purity 
floating-zone sample for which results were given in Fig. 15. The sample 
Was then heated in an argon oven for 30 minutes at 950°C. The excess 
gold was then removed and the sample re-etched. From the increase in 
resistivity it was estimated that approximately 3 x 10" atoms/em* of 
gold were added. This is based on published data®! on the donor and 
acceptor levels introduced by gold in silicon (donor level 0.35 eleetron 
volt from valence band; acceptor level 0.54 eleetron volt from conduction 
band), and on the assumption that each gold atom is electrically active. 
The sample was oxidized (300 angstroms) and the zero-frequency field 
effect measurement was repeated. The results are shown in Fig. 16. One 
now finds that the surface is n-type, the conductivity minimum is barely 
observed and the field effeet mobility has decreased markedly to about 
80 em? /volt-sec. The oxide was then removed by HF vapor, followed by 
a water rinse, and the surface reverted to p-type. From this, one con- 
cludes that added body impurities such as gold (and possibly others that 
were unknowingly introduced into the crystal along with the gold), ean 
diffuse to the surface and produce donor-type surface states that will 
induce a strong n-type surface. Finally, to determine the effectiveness of 


the oxide in gettering body impurities, the same sample was re-oxidized 
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Fig. 16 — Zero-frequency four-point probe field effect effect of adding gold 
to crystal 


for 65 hours at 920°C (4000 angstroms), the oxide was removed and a 
one-half-hour oxide was regrown (300 angstroms). Field effeet measure- 
ments were made and are presented in Fig. 17. Comparing this with 
Figs. 15 and 16, one finds the resulting surface still n-type, yet the field 
effeet mobility is slightly higher and the conductance minimum is more 
pronounced. From body resistivity measurements, one calculates a de- 
crease in gold content from the initial concentration of 3 xX 10! to 
1X 10" atoms/cm’. 


1.6 Comment on the Nature of Surface States at an Oxidized Surface 


It Was pointed out in the introduction that surface states exist on an 


atomically clean surface. These are associated with the dangling bonds 








Fig. 17 Zero-frequencey four-point probe field effect gold gettering by 
prolonged oxidation 
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or free orbitals. These ‘“‘Tamm”’ states are acceptor-type states which 
are neutral when unoccupied and negatively charged when occupied. It 
has further been shown, theoretically, that, at the interface between 
two dissimilar crystals, Tamm-type states will also exist. In the case of 


a thermally grown oxide on a silicon surface, one may visualize a gradual 


transition in structure (within a few atomic layers) from an all-erystal- 
line structure to an all-amorphous structure. The composition must also 
undergo a gradual transition from all silicon to all silicon dioxide. One 
intuitively expects, therefore, that such a transition should give rise to 
some ‘Tamm-type surface states, as well as states associated with vacan- 
cies due to lattices mismatch. For the surface states observed in our ex 
periments with thermally oxidized surfaces, we propose the following 
tentative model. The acceptor-type states observed are Tamm-type 
states and vacancy states, while the donor-type states observed are asso- 
ciated with impurities that may be introduced in a variety of ways. 
From this, the concepts result of surface doping and surface-state com- 
pensation, in exact analogy to their counterparts in the body of a semi- 
conductor. For further examination of this model, it appears quite im- 
portant to study the surface states on an atomically clean silicon surface 
which has been thermally oxidized in pure oxygen. This study is cur- 
rently in progress. 


V. CHARACTERISTICS OF THERMALLY OXIDIZED DEVICES 


5.1 Characteristics of Oxidized Diodes 


The oxidation process has been applied to several thousand diffused 
junctions. These included n*-p and p*t-n diodes (graded junctions) in 
the range of breakdown voltages of 20 to 400 volts, n-p-n and p-n-p strue- 
tures and p-n-p-n switching transistor structures. Crystals used were all 
pulled crystals, with phosphorus and boron the doping impurities. 

The following results were all obtained on junctions with thin oxide 
films, 150 to 300 angstroms, prepared at 920°C in dry oxygen (10 to 30 
minutes oxidation time). After oxidation the junctions were quenched in 
room air. No special precautions were taken for their storage; they were 
usually stored in plastic boxes in room air. Units stored in this fashion 
for as long as 15 months have shown no change in electrical character- 
istics. Most of the measurements given below were obtained in room air, 
and contacts were made by a point pressed on the top surface of the 
device. By applying a sufficiently high voltage it is possible to break 
through the oxide film (only for the thin films) and obtain satisfactory 
contact for reverse bias current measurements. 
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Fig. 16 — Zero-frequency four-point probe field effect effect of adding gold 
to crystal 


for 65 hours at 920°C (4000 angstroms), the oxide was removed and a 
one-half-hour oxide was regrown (300 angstroms). Field effeet measure- 
ments were made and are presented in Fig. 17. Comparing this with 
Figs. 15 and 16, one finds the resulting surface still n-type, vet the field 
effeet mobility is slightly higher and the conductance minimum is more 
pronounced, From body resistivity measurements, one calculates a de- 
crease in gold content from the initial concentration of 3 xX 10" to 
1 & 10" atoms/em?’. 


1.6 Comment on the Nature of Surface Slates at an Owidized Surface 


It was pointed out in the introduction that surface states exist on an 


atomically clean surface. These are associated with the dangling bonds 





Fig. 17 Zero-frequency four-point probe field effect gold gettering by 
prolonged oxidation 
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or free orbitals. These ‘“‘Tamm”’ states are acceptor-type states which 
are neutral when unoccupied and negatively charged when occupied. It 
has further been shown, theoretically, that, at the interface between 
two dissimilar crystals, Tamm-type states will also exist. In the case of 


a thermally grown oxide on a silicon surface, one may visualize a gradual 


transition in structure (within a few atomic layers) from an all-crystal- 
line structure to an all-amorphous structure. The composition must also 
undergo a gradual transition from all silicon to all silicon dioxide. One 
intuitively expects, therefore, that such a transition should give rise to 
some ‘Tamm-type surface states, as well as states associated with vacan- 
cies due to lattice mismatch. For the surface states observed in our ex 
periments with thermally oxidized surfaces, we propose the following 
tentative model. The acceptor-type states observed are Tamm-type 
states and vacaney states, while the donor-type states observed are asso- 
ciated with impurities that may be introduced in a variety of ways. 
From this, the concepts result of surface doping and surface-state com- 
pensation, in exact analogy to their counterparts in the body of a semi- 
conductor. For further examination of this model, it appears quite im- 
portant to study the surface states on an atomically clean silicon surface 
which has been thermally oxidized in pure oxygen. This study is cur- 
rently in progress. 


V. CHARACTERISTICS OF THERMALLY OXIDIZED DEVICES 


5.1 Characteristics of Oxidized Diodes 


The oxidation process has been applied to several thousand diffused 
junctions. These included n*+-p and pt-n diodes (graded junctions) in 
the range of breakdown voltages of 20 to 400 volts, n-p-n and p-n-p strue- 
tures and p-n-p-n switching transistor structures. Crystals used were all 
pulled crystals, with phosphorus and boron the doping impurities. 

The following results were all obtained on junctions with thin oxide 
films, 150 to 300 angstroms, prepared at 920°C in dry oxygen (10 to 30 
minutes oxidation time). After oxidation the junctions were quenched in 
room air. No special precautions were taken for their storage; they were 
usually stored in plastic boxes in room air. Units stored in this fashion 
for as long as 15 months have shown no change in electrical character- 
istics. Most of the measurements given below were obtained in room air, 
and contacts were made by a point pressed on the top surface of the 
device. By applying a sufficiently high voltage it is possible to break 
through the oxide film (only for the thin films) and obtain satisfactory 
contact for reverse bias current measurements. 
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Fig. 18 shows the reverse V-/ characteristics for two different boron- 
doped erystals A and B. The junctions were obtained by diffusing phos- 
phorus into a mechanically polished surface, giving a junction which is 
about 0.001 inch deep. The structure is essentially n+-p. Each slice had 12 
individual diodes 0.025 & 0.025 inch, and slices 4 and B had body break- 
down voltages of 36.5 and 48 volts, respectively. The circles indicate the 
average current for each voltage; the range is also indicated. It is seen 
that the spread is within a factor of 2 to 3, which has been typical for 
ul our results within one slice and within various slices with the same 
history. From erystal to crystal, however, and sometimes from one diffu- 
sion run to another, the reverse currents may vary by one to two orders 
of magnitude. This is illustrated in Fig. 18. This figure also indicates 
another significant effect that has been consistently observed with n+-p 
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Fig. 18 — Typical characteristics of oxidized n*t-p graded junction silicon diodes 
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hig 19 Fy pical characteristics of oxidized n p graded Junction silicon diodes 


diodes. For junctions giving higher reverse currents, there is less dependence 
of current on voltage. In general, we have observed that for n*p graded 
Junctions the reverse current ts proportional to the applied voltage raised to 
a power which varies from nearly zero to one-third. Crystal A in Fig. 18 
gave a power of 0.11 to 0.18 (in the range | to 10 volts). Fig. 19 shows 
the same effects for two other sets of n*-p diodes made from two differ 
ent crystals, with higher body breakdown voltage as indicated. For set 
c the power dependence varied from 0.14 to 0.28. 

From measurements of lifetime by the injection-extraction technique," 
it Was shown that the body saturation current (due to diffusion aceord- 


ing to the simple diode theory*®) is negligible. The contribution of body 


current to the observed currents must, therefore, be due to space-charge 
generation.*® 7.5 For graded junctions as used above, however, the 
space-charge generation current is proportional to the applied voltage 
(plus a built-in voltage) to the one-third power. Occasionally, this de 
pendence has been observed with oxidized units, but, in most cases, 
the dependence observed is weaker for nt-p junetions, as discussed 
above. This strongly suggests a separate contribution of the surface to 
the observed currents. 
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Fig. 20 Separation of body and surface currents for oxidized diodes 


To illustrate this, the following experiment was performed. On one 
slice of diffused n*-p material, 22 rectangular or square units of different 
sizes were etched and oxidized to a thickness of about 200 angstroms. 
The range of perimeter to area ratio obtained was 40 to 200 inch~!. This 
is based on the nominal dimensions rather than the actual dimensions, 
including perimeter irregularities, resulting from etching, and accounts 
for some of the spread of the data shown in Fig. 20. This figure is a plot 
of the //A versus P/A, where J is the reverse current measured at 10 
volts, A is the nominal area of the diode and P is its nominal perimeter. 
If the currents observed were only body currents, this plot would be a 
horizontal line, indicating that the current density is constant, regardless 
of shape or size of the diode. One observes, however, that the experi- 
mental points have a definite trend towards an inerease in //A with an 
increase in the P/A ratio. The median line is shown, together with two 
other lines that contain all the points except one. From the intercepts 
and slopes, one finds that the currents observed can be split into body 
and surface components. From Fig. 20, the body component is 5.3 X 
10-8 amperes/em? and the surface component is 3.5 & 10-° + 40 per 


cent amperes/em, both measured at 10 volts (for a 40-volt breakdown 


‘ 


voltage n*-p diode) and at 25°C 
For further identification of the mechanisms of the body and surface 
currents, one must repeat the above experiments at various tempera- 
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tures and examine the activation energy in different temperature ranges. 
Preliminary results indicate that, between room temperature and 250°C, 


the activation energy increases with temperature, approaching approxi- 


mately 1 electron volt near 160°C and in some cases being as low as 0.55 
electron volt at room temperature. Tentatively, this suggests that re- 
verse currents near room temperature are due to space-charge generation 
by traps located approximately at the middle of the energy gap, and 
that, at higher temperatures, the usual saturation currents due to diffu- 
sion will predominate (activation energy equal to the energy gap). 

Now, returning to Figs. 18 and 19, one must explain why the depend- 
ence of current on voltage is to a power less than one-third. This may be 
explained in terms of our results from field effect measurements on pulled 
crystals, as reported in previous sections. It was pointed out that, for 
pulled crystals, the general tendency is to form n-type oxides, i.e., oxides 
with a predominance of donor-type states which may invert the surface 
of a p-type erystal. This means that a ‘‘channel” can form on the p-side 
of an n*-p diode. The surface component of the observed current, there- 
fore, corresponds to this channel current. Now, if sufficiently high voltage 
is applied, the channel will ‘‘pineh off and its current will essentially 
saturate. This current, added to the body current which varies as V! 
gives the total current, which will appear to have a weaker dependence 
on voltage. The stronger the channel formed, the higher its pinch-off 
current will be and the voltage dependence of the total current above 
pinch-off voltage will get weaker. For the results of Figs. 18 and 19, this 
suggests that the pinch-off voltages were below one volt, the lowest volt- 
age used in these experiments. More recent work on n-p-n structures, 
whereby similar channels were formed by oxidation and the channel 
characteristics obtained both before and after pinch-off, substantiates the 
above explanation. 

The above reasoning suggests that one should obtain a markedly dif- 
ferent behavior for pt-n structures. This is, indeed, the case, and Figs. 
21 and 22 show typical reverse characteristics of two different samples, 
E and F, of p*-n diodes. The junction depth is about 0.001 inch, and the 
body breakdown voltages are 64 and 82.5 volts for samples & and PF, 
respectively. Sample & consists of 12 diodes and sample F consists of 7 
diodes. In the range of 1 to 10 volts, the current is proportional to the 
voltage raised to a power of 0.4 to 0.64, which is in excess of the one-third 
power expected for a graded junction on the basis of space-charge gener- 
ation. This, again, is due to surface contribution, which is explainable 
in the same way we have explained the behavior of n*-p junctions. The 


oxide is n-type, which tends to form an enhancement or enrichment layer 
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on the n-region of the pt-n junction. If this enhancement is not suffi- 
ciently strong, one obtains softer reverse characteristies, as shown in 
Figs. 21 and 22. It is believed that this soft characteristic is made up of 
a succession of small localized surface breakdowns which occur at various 
voltages below body breakdown. We have seen cases where, instead of a 
gradual softness, the reverse characteristic is made up of a succession of 
distinct segments starting at some low voltage and proceeding until body 
breakdown is reached. The slopes of these lines correspond to resistances 
of the order of several thousand ohms. This suggests localized surface 
breakdowns. These effects are now being studied for shallow junctions 
(of the order of a micron), using the light emission technique.**? Pre- 
liminary results indicate that the occurrence of each of these segments 
(or knees on the V-/ curve) corresponds exactly to the onset of light 
emission (white light from one point on the junction surface). 


5.2 Characteristics of Oxidized p-n-p-n Diodes 


Another device that was oxidized was the two-terminal p-n-p-n strue- 
ture transistor switch, which is an all-diffused diode with junction spac- 
ings of the order of 0.0001 inch. Fig. 23 shows typical characteristics of a 
set of 20 units processed on one slice, including the high impedance char- 
acteristics in both directions. (All these units had a 300-angstrom 
oxide.) The circles represent the average readings and the corresponding 
spreads are indicated. 
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Fig. 22 Typical characteristies of oxidized p*-n graded junction silicon diodes 

In this device, as well as in various others, it is important to maintain 
the lifetime in the oxidation process. For a switching diode, the life 
time will be mainly reflected in the turn-on current of the device. This 
requirement is compatible with the oxidation process, provided that the 
cooling after oxidation is controlled. It was possible to maintain the life 
time, or even show some improvement, by cooling the oxidation oven at 
arate of a few degrees per minute. This was checked on both diodes, 
using the injection-extraction technique for lifetime mensurement, snd 
on the p-n-p-n switch, as reflected in its turn-on current 


VI. NOISE IN OXIDIZED DIODES 


Excess noise in single-crystal filaments*:” and diodes* usually exhibits 


a 1/f* spectrum (@ ~ 1) and is sensitive to surface conditions. Several 


theories# 4445.46 have been proposed to explain this lrequeney depend 


ence. MeWhorter,® for instance, suggests that the 1 f noise is caused 
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Fig. 23. Typical characteristics of oxidized p-n-p-n silicon switching diodes 
primarily by fluctuations in the occupancy of slow surface states. From 
field effect measurements as a function of frequency he has deduced a 
distribution of the relaxation times for the slow states on germanium 
surfaces, and he has shown that the same distribution can account for 
the 1/f dependence of excess noise. For single-crystal filaments, the 


fluctuating occupancy of slow states produces conductivity modulation 


in the bulk. In diodes and transistors, it also modulates the recombina- 
tion rate at the surface, because of the relation between surface recom- 
bination velocity and surface potential.” 

As discussed previously, thermally oxidized silicon surfaces show no 
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Fig. 24 Noise in a thermally oxidized silicon diode. 
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Fig. 25 Noise in silicon diode after removal of oxide and light etching 


effects of slow states, so that it is of interest to determine whether the 
1/f noise also is absent. Preliminary measurements of excess noise in 
oxidized silicon diodes have shown the following: 

i. Reverse-biased diodes operated at 75°C in dry air show no noise in 
excess of shot noise to frequencies as low as 30 eps (Fig. 24). 

ii. Removal of the oxide in HF vapor generally does not introduce any 
additional noise. 


r( 


ii. After a light etching in an HF-HNQOzs solution a 1/f° 


65 


dependence 


of excess noise Was found up to a frequency of about 10 ke (Fig. 25). 


VII. SUMMARY 


The problem has been re-examined of the stabilization of silicon sur- 
faces when the silicon surface is chemically bound to an appropriate solid 
film that is well defined in composition and structure. We have studied 
one such system, namely, the silicon-silicon dioxide system when the 
oxide film was produced by high temperature oxidation of the silicon 
surfaces. 

i. Film properties. The films produced were amorphous, continuous 
and uniform. Various thicknesses were studied from a few hundred ang- 
stroms to tens of thousands of angstroms. The films had a resistivity of 
about 10!* ohm-cem, dielectric constant of about 4, and dielectric strength 
of about 107 volts/em. The concentration of impurities which were de- 
tected as mobile ionic charges were less than 10'/em*. 

li. Interface properties. A completely new kind of surface was produced 
by thermal oxidation, with its own characteristic distribution of surface 
states. These were fast or interface states; slow or outer states were not 
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observed. The 1/f noise commonly associated with fluetuations in slow 
states was also not observed down to a frequency of 30 eps. The fast 
states consisted of both donor-type and acceptor-type states. Their den- 
sities and distributions, which determine the surface potential (surface 
inversion or accumulation), were affected by both the pre-oxidation 
treatment and the concentration of certain impurities in the silicon 
erystal. Furthermore, the resulting surface potential was virtually locked 


and showed no sensitivity to wet ambients, ammonia vapor or ozone. 


ili. Compatilility. We have demonstrated the applicability of the oxi- 
dation process to certain device structures. We must stress, however, 
that it cannot be considered as a universal process which can be applied 
to any arbitrary device structure and maintain some desired surface 
properties. The oxidation precess provides not only a stable surface but 
a completely new kind of surface with its own characteristic properties. 
In applying the process to devices, it cannot be considered simply as a 
stabilization of surface properties but must, instead, be considered as 
an integral part of the device structure. From this evolves the important 
concept of “surface design”? which should be incorporated in the early 


stiuges ol device design. 
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Analysis and Design of a Transistor 


Blocking Oscillator Including 
Inherent Nonlinearities 


By J. AJ NARUD and M. R. AARON 


Manuseript received July 17, 1958 


An analysis of transistor blocking oscillators is presented which differs 
significantly from previous approaches in that the nonlinear dependence of 
collector current and base voltage upon base current 7s considered. First. the 
nonlinear differential equations governing circuit performance are derived 
considering the effects of alpha cutoff, collector capacitance and leakage and 
magnetizing inductance. The relative amportance of the various terms in this 
equation during transition, relaxation and recovery intervals is discussed 
from a ph ysical vive wporne. 

Analog and digital computer solutions of the nonlinear differential equa 
tion yield pulse responses which give excellent agreement with experimental 
results for various signals, transistor characteristics and values of passive 
cirewl elements. The paper concludes with a de sig example in which the 


analysis 7s confirmed by « rperiment 


I. INTRODUCTION 


Transistor blocking oseillators are finding appheation in digital com 
puters! and transmission systems’ where it Is necessary to reconstruct a 
pulse train that has been dispersed in a noisy medium. In fact, several 
papers’ 4° have been published concerning the design and analysis of 
such circuits. All of the previous analytical aupproaches have been based 
on the use of a piecewise linear analysis. Linear circuit: approximants 
have been used which are approximately valid during portions of the 
circuit operation, and formulas have been derived for the rise time and 
pulse width of the output pulse. The piecewise linear approach does not 
deal with the inherent nonlinear character of the circuit, and the ap 
proximations involved in linearization are often difficult to justify. The 
most serious shortcoming of the linearization technique is the fact that 


785 
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it does not give information concerning trigger requirements for reliable 
operation. 

In view of the inherent restrictions of the approximate analysis, a 
more detailed analysis which deals directly with the nonlinear nature of 
the circuit has been undertaken. The following sections of this paper will 
be concerned with the development of this procedure. First, the equiv- 
alent circuit, including the nonlinear dependencies, will be displayed and 
the assumptions on which the analysis is based will be discussed. The 
nonlinear integro-differential equations governing the performance of 
the circuit throughout its entire cycle of operation will be derived. A clear 
qualitative picture of the circuit operation is achieved with the non- 
linearities included. Physical reasoning permits the dissection of the 
nonlinear equations into equations governing the circuit operation during 
the transition intervals (‘‘off and on’’), “ton” interval and “recovery” 
interval. Trigger requirements for reliable regenerative action are de- 
rived. These are, of course, dependent on the nonlinear characteristies 
of the transistor as well as on other circuit parameters. Requirements 
on the circuit parameters to minimize ringing, achieve fast rise time, 
and recover quickly for repetitive operation are developed. Application 
of the results of the analysis to a specific example is given and confirmed 
by experiment. 

Both analog and digital computers have been used as adjuncts to the 
strictly analytic approach to search out relationships between circuit 
parameters to achieve desired performance. It is believed that the com- 
bined analytical and computer approach used in this paper is more 
general and more easily applied than the piecewise linear analysis and, 
furthermore, that it is a valuable design philosophy for exploring other 
nonlinear circuits. Further work on other nonlinear transistor circuits 


using this approach is being pursued 
Il. THE EQUIVALENT CIRCUIT 
In selecting the equivalent circuit for an active device one is usually 


confronted with two conflicting requirements. On the one hand, the 
equivalent circuit must accurately exhibit the same properties as the 


physical device, so that computations based on the equivalent circuit 


give substantial agreement with experiment. On the other hand, there 
is a twofold requirement for simplicity. First, the number of parameters 
involved should be minimized to make the analysis tractable. Second, 
it should be possible to draw qualitative conclusions regarding the circuit 
operations from the equivalent circuit. In the case of the transistor 
blocking oscillator, these requirements are particularly important. Since 


the oscillator is highly nonlinear in its operation the number of coupled 
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equations involved is closely related to the complexity of the equivalent 
circuit. 

With these factors in mind, the equivalent circuit shown in Fig. 1 (a) 
was chosen to represent the transistor in the grounded emitter blocking 
oscillator. It takes into account the following properties of the transistor: 
(a) the nonlinear relationship between base voltage and base current; 
(b) the nonlinear dependence of collector current on base current and 
frequency; (c) partially, the effect of minority carrier storage during 
“turnoff” and (d) the combined effect of collector and stray capacitance. 

However, the circuit neglects, among other things, the effeet of cou 
pling between the base and the collector circuit, emitter junction capaci- 
tance, base spread resistance and recombination. In the following we will 
discuss the quantities listed above, and how they can be obtained from 
measurements on the transistor. The surface barrier transistor 2N128 
will be used throughout this paper to exemplify the procedure; the re- 
sults are applicable to other types of junction transistors with broadly 
similar characteristics. Modifications in the analysis to accommodate 
drastic differences from this model can be made, with an attendant in- 
crease in the complexity of the analysis. 


2.1 Nonlinear Re lationship Between Base Voltage and Base Current 


In the equivalent cireuit of Fig. 1(a) the nonlinear relationship be 


tween base voltage and current is denoted by; 
Vs lb). ( ] 


The dependence on collector current has been neglected since, as shown 
later, the terms involving 7 in the equations governing the blocking 
oscillator are negligible during the ‘‘turn-on”’ and ‘fon’ intervals. Thus, 
m may be obtained by measuring the base voltage as a funetion of base 
current, with the collector circuit connected to the load and bias with 
which it will normally operate. The result of such a measurement is 
shown in Fig. 1(b). It is seen that this result is almost a straight line 
above a certain value of 2 (saturation) and almost vertically straight 
when 7% < 0. Thus, it can sometimes be approximated by two straight- 
line segments through the origin with different slopes.” Another way of 
approximating it is to use the relationship between current and voltage 


of p-n junction diodes: 


s) 


where h7'/q 0.026 volt at room teraperature (25°C) and /, is the re 
verse saturation current. 
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2.2 Nonlinear De pendence of Collector Current on Base Current and Fre 
quency 


The dependence of the collector current upon base current and fre- 
quency is assumed in the equivalent circuit to be of the form 


(1 + _) i T.(%), (3) 
wo ¢ 


where /.(%,) represents the static collector-current-base-current charac- 
teristic, 7. the actual collector current, and wo the “large-signal” cutoff 
lrequency or inverse time constant of the transistor. This expression is 


the “large signal” equivalent of the linear relationship 


dl, 
diy 


1 + 


Wa | 7 


The derivative of (3) with respect to 7 


reduces to (4) when the circuit 
operation is linear. The funetion /,(% 


may be determined experi- 














Fig. | a 
lent cireuit b) vu, vs. a4: (e 


The grounded emitter configuration and its large-signal equiva 
be VS. tt 
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mentally by measuring the collector direct current for various values of 


base current, with the collector connected to the load and bias with which 
it will normally operate. Such a characteristic is shown in Fig. 1(e). It 
is seen that /,(%) saturates very sharply, the saturation value being up 
proximately /,./R,.. In some cases in the subsequent analysis it will be 
convenient to approximate this function with an analytic expression. 
rom (4) it is seen that, when the circuit operation is linear, the effee- 
tive cutoff frequency, wo , for the grounded emitter configuration is 


and therefore is dependent upon w, and 8. Thus, as the operating point 
is changed, wp will vary, having a minimum when 8 (or @) is & maximum 
and becoming equal to w, When 8 = 0 as, for example, at cutoff or satura 
tion. When a large signal is applied to the transistor, a») will, therefore, 
change continuously as the various points of the 7./% characteristics are 
traversed and the “effective cutoff frequency” will then be some kind 
of average of (5). Although w, 1s also dependent on the location of the 
operating point, it has been found that the w, determined at standard 
bias (V..) vields transient results which agree closely with experiment 
These results are shown in Table I and discussed further below 

Moll has suggested the following ealeulation of the large-signal cutoft 
frequency, wy : If an average 0 is defined as the ratio of the total exeur 


sion of the collector and base currents; that is, 
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then the large-signal cutoff frequency is given by: 


=, (7) 

1l+8 
Accordingly, in a circuit like the blocking oscillator, in which the opera- 
tion extends from cutoff to saturation, w) will be given by (neglecting 


The large-signal cutoff frequency may also be determined experi- 
mentally by applying a current step function of magnitude /,, to the 
base and then measuring the 10 to 90 per cent rise time of the voltage 
across the load resistance. Since the out put response can be described by 
au single exponential, provided the load resistance, R, , is selected to be 


so small that the effect of the collector capacitance may be neglected, 


wy IS given by: 
(YQ) 


where 7’, is the 10 to 90 per cent rise time. In Fig. 2(a) the output 
response is shown for steps of various magnitudes in base current. It is 
seen that the rise time continually decreases as the size of the step in- 
creases, and that the response is very nearly exponential in shape, thereby 
justifying the above approximation. 

With two ways of calculating wo, the natural question that arises is 
how well these two methods agree. To provide a partial answer to this 
question, the calculated and measured rise times of six transistors were 
compared, The rise time was computed by use of (8) and (9) and meas- 
ured across a 100-ohm load when a current step function of magnitude 


I,, was applied to the base. The results are shown in Table I, from which 


it is seen that the calculated and measured values agree within +10 
per cent and the average values agree within 2 per cent. Taking into 
account the error involved in measuring rise times from the face of an 


oscilloscope, the agreement is indeed quite good. 


2.3 Effect of Minority Carrier Storage During Turnoff 


The effect of minority carrier storage in the collector-base region is not 


very easily related to any of the commonly measured transistor parame- 
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ters. The most convenient way to take it into account is to determine its 
effective time-constant experimentally. Vig. 2(b) shows the voltage 
across a 400-ohm load resistor in series with the collector of a junction 
surface barrier transistor (2N128) when various square current pulses 
are applied to the base. The rise time, fall time and width of the applied 
current pulse were 1, 7 and 100 milimicroseconds, respectively, and its 
magnitude was varied in steps from 0.2 to 10 ma. The base saturation 
current for that particular transistor (Unit 151) was 1 ma. From the 


Fig. 2 Collector current in response to (a) a step of base current, 7, ; (b 
pulse in base current of magnitude 7, , with rise time of 1 my sec, fall time of 
mu see and width of 100 my sec. Time scales: (a) 25 mp see/division; (b) first four 
curves, 50 mu sec/division; next two curves, 100 my sec/division; bottom curve, 
250 my sec/division 


a 
‘ 
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figures it is seen that, in contrast to the rising edge of the output pulse, 
the falling part becomes slower as the magnitude of the input pulse is 
increased, Also, the 10 to 90 per cent fall time is almost constant up to 
saturation, after which it increases rapidly. Again, since the falling re- 
sponse has roughly an exponential shape the fall time can be used to 
estimate the equivalent wy during turnoff by the relationship: 


9) ») 
—— , (10) 
ie 


wo) ™~ 
Columns 8 and 9 of Table I list the fall times and the corresponding 
wy’s for the six transistors. 


2.4 Combined Effect of Collector and Stray Capacitance 


In the small-signal case the output impedance of the grounded emitter 


configuration Is: 
r + aZ. + Z, 


—, (11) 
n+re+Z, 


where Z, is the impedance of the generator driving it. Since, in the 
blocking oscillator, Z, is large and r, is very small, the output impedance 


consists essentially of a capacitance of magnitude 
(12) 


Under large-signal operation, both 6 and C, will vary over the range, 


the collector capacitance varying with the collector voltage as 
e kV." (13) 


where the exponent ris 5 for step junctions and 4 for uniformly graded 
junctions. Measurements made on a dozen 2N128 transistors yield an 
average value for r very close to 4. 

Bashkow” has shown that, as far as transient analysis is concerned, 
this nonlinear capacitance may be replaced by an average capacity Cay . 
The average capacity is defined as that which displaces the same charge 


as the nonlinear capacity for the same voltage change. A minor generali- 
, 1 
V, ) 
Ve 


where V,,. is the collector voltage at cutoff and V,/V,, 1s the ratio of the 


zation of Bashkow’s results vields 
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voltage swing on the collector. For a swing from V,, to 0, and 


(12) becomes 
1.5C.(1 + 8B 
where, as in the case of wy , 8 has been replaced by 8. In a physical cir- 
cuit, st ray capacity from collector to ground should be added to C 
With the transistor characterized in this manner it is readily shown 
that the Laplace transform of the collector current of the resistively 


loaded (R.) common emitter stage is given by 


Ss (>) 
o(. + Linc +. }) : 


for a step function in base current sufficient to saturate the transistor 
To check the validity of the equivalent circuit, the 10 to 90 per cent rise 


I.(s) 


time was computed for six transistors from the inverse transform of (16 
using (7) and (15).* The rise time was also determined experimentally 
and the ratio of observed rise time to computed rise time is given in the 
last column of Table I. It can be seen that agreement within +15 per 
cent has been achieved on individual units, and that the average is within 
2 per cent. These figures are within both the experimental error in read 
ing a scope face and the error inherent in determining C, , w, and g 
In passing, it should be noted that a good approximation to the LO to 
90 per cent rise time is given by 
> BD 
T p —* eR2C¢ . 17 
wo 
The above expression results from (16) by an application of the rise time 
definition given by Imore.” This will be mentioned wean in the section 
devoted to the rise time of the blocking oscillator. It can also be shown 
by using Elmore’s expression that the delay time (time to traverse from 
zero to 50 per cent of the final out put) is viven by 
L + alt Ci 
IS 
wo 
Over a wide range of parameters, two times the time delay, as given 


above, is a very close approximation to the actual rise time. In this case, 


the rise time {two times (18)| is similar in form to that given by Easley 


and can be justified by other considerations. tT Equation (18) will be 
*5 micromicrofarads were added to Cy to account for stray socket and wiring 
capacity 

t Several definitions of rise time are given in Ref. 11, one of which, when applied 
to the grounded emitter stage, yields (1S 
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useful in the following sections in indicating how the factors in the design 
of the transistor blocking oscillator can be related to the rise time or 
delay time of the resistively loaded transistor alone. 

Before proceeding to the blocking oscillator, it should be mentioned 
that the equivalent circuit of Vig. 1(a) may be used for the grounded 
base configuration. In this case, wo We and Cy 1.5C,. Therefore, 


the following discussion is also applicable to the grounded-base case. 


Il. THE GROUNDED EMITTER TRANSISTOR BLOCKING OSCILLATOR 


lig. 3 shows the circuit diagram as well as the equivalent cireuit for 
the grounded emitter blocking oscillator with series feedback. The pri- 
mary of the transformer is connected in series with the collector and the 
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Fig. 3 (a) Grounded emitter transistor blocking oscillator; (b) nonlinear 
equivalent circuit 
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load resistance, PR. , and its secondary is connected between the base and 


the bias source /,,,. The polarity of the coupling is such that positive 


feedback exists around the loop when the transistor is conducting. The 
current generator 7; represents the trigger source and R, the total ex- 
ternal load in the base circuit.* As discussed in the previous section, wo 
represents the large-signal cutoff frequency of the transistor and is re- 
lated to wa through (8). The capacitance Co, is the sum of the total 
stray capacity of the circuit and that of the transistor given by (15). 
Finally, L. , L,, and Cy represent, respectively, the leakage and magnetiz- 
ing inductances and the parasitic capacity of the transformer. 

Briefly, the cireuit operation is as follows. For monostable operation 
the bias 1 is selected in such a way that the transistor is cut off at its 
stable operating point. If a trigger signal of sufficient magnitude and 
width to bring the circuit into its regenerative region is applied, the 
collector and base currents will build up until a point in the saturation 
region is reached. (This point will hereafter be referred to as the ‘“‘quasi- 
stable operating point.) Then the magnetizing inductance becomes 
charged slowly, shorting out more and more of the current fed back to 
the base. This process will go on until the gain around the loop again 
becomes equal to unity, at which point the circuit regenerates back into 
a state of cutoff. Then the energy stored in the transformer slowly 
dissipates and the circuit recovers to its stable operating point. 

The equation governing this complete cycle ot operation is derived in 
Appendix A and normalized in Appendix B. 

One point is immediately obvious from these equations: they are ex- 
tremely long, complicated and relatively useless as they stand. In order 
to restore the physical feel for the circuit, it will be shown in the re- 
mainder of the paper that the equations can be simplified, depending on 
the region of operation. The primary basis for reducing the more exact 
equations will be whether or not the base emitter junction is forward- or 
back-biased. In principle, this approach is similar to that used in piece- 
wise linear analysis. There are several important advantages, however, 
in starting with the more exact description of the circuit given here. First, 
retention of the nonlinear terms enables one to determine the operating 
points of the circuit, which, in turn, give a clear definition of the trigger 
requirements for reliable operation. Second, the approximations involved 
in taking the general equations apart can be easily drawn. Finally, the 
simulation of the blocking oscillator from the exact equations on an 
analog or digital computer can be readily achieved for purposes of com- 

* For convenience, R, has been placed between base and Fy, rather than from 


base to ground. This approximation serves to eliminate a few small constant terms 
from the governing equation in Appendix A, and in no way restricts the analysis 
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plete analysis or evaluation of the approximations that are made in tear- 
ing the more complete description apart. The computer approach also 
permits the investigation of the effects of the various parameters of the 
circuit on its response under conditions that can be more closely con- 
trolled than can be those in the experimental circuit. 

The equations can be broken down in accordance with the importance 
of the various terms during the following four intervals: (a) ‘Transition 
On”, (b) “On”, (e) “Transition Off and (d) “Reeovery.” These will 


be discussed separately in the following sections.* 


IV. THE “‘TRANSITION ON”? INTERVAL 


This interval is defined as the time from the initiation of the “On” 
trigger pulse to the time at which the circuit has apparently come to 
rest at its quasistable operating point, that is, the time when all transients 


associated with switching have ceased. Specificially, 


I, " lo < I, ° (19) 


By the time triggering occurs, the input impedance to the transistor is 
| 
i 


ow compared to R, and 
(20) 


where | Va(t y|/n represents the voltage drop that must be main- 
tained across the primary of the feedback transformer to overcome the 
bias and forward drop. This voltage is unavailable in series with the 
load and should be made as small as possible, implying the use of a 
transistor with a low forward drop. Fig. 1(b) shows that this voltage is 
essentially constant during the on interval; therefore very little feedback 
current will be diverted by the transformer capacity Cr. Subject to 
condition (20) and the constancy of [V2(%) — Fmj/n, this term and 
derivatives thereof may be neglected. During the switching interval, the 
contribution from the integral 


R,, € En 


“0 R, 


will be negligible, since the magnetizing inductance is usually large in 


* The equations in Appendix A can be readily modified to deal with the block 
ing oscillator with shunt feedback. For example, 2. = 0 and the load resistance is 
reflected through the transformer in parallel with R, 
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comparison with the other parameters of the circuit.* Finally, the mag- 
netizing current is equal to n/,,.. Therefore, in this interval, (109) of 
Appendix A simplifies to 


L.Co diy R.« 0 di, | di . 
- LA. : ee _— 1) 
wy dé y ( \ wo ) t (* ee Ut 


d R.CNE LCod 
+ RC» - nC = [,(t) 
alias F ( . a \s a ‘| 


The equation governing the equilibrium points (or the “steady-state 
equation,” as it has been called) for this interval is obtained by setting 
/,(t) and all the derivatives of % equal to zero; that is, we obtain 

I (ts) = 1, + Be = 0, 


where 


[(i,) I Nt) [ + Ry ( + Rs) Volt) 
n n Lam wy Ry, 


represents the nonlinear or active part of this equation. In this expres 
sion, the term /,(7,)/ represents the contribution to /(%,) by the col 
lector current and the term [Vy(7) — Hy} /R, , as noted, is the amount of 


current flowing down through the resistance PR, to overcome the bins 


The term 
R, | V,(%) By 
T R . ) 
nom (| R, 


represents the modification in /(7,) due to finite magnetizing inductance, 
or the equivalent drain of this inductance. In other words, the mag 
netizing inductance may, as far as the steady-state equation is concerned, 
be thought of as modifying the resistance R), to: 


Re, 


(L+H) 


It is instructive to study the graphical solution of the steady state 


Rh, 


R, 
id OP 


equation, since this reveals the locations of the operating pots nid 
gives a qualitative idea of the magnitude of the trigger signal needed to 
trigger the circuit. In lig. t the various components ol T(t) and the 
graphical solution of the steady-state equation are shown. For the sake 

* The principal effects of neglecting this term are: (a) the trigger current re 
quired for reliable triggering will be somewhat larger than indicated in the follow 


ing for slow trigger signals; (b) the rise time will also inerease slightly above that 
computed in the following, due again to diversion of feedback current through L 
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of clarity the contribution due to the term containing V(2) has been 
somewhat exaggerated. 

In this figure the long-dashed curve represents the dynamic /,(%)/n 
characteristic for the transistor and the short-dashed curve the contribu- 
tion due to the [V(a) — Ey|/R, term. The solid curve constitutes the 
combined effect of these two curves, while the three intersections with 


the straight line 2, /,. represent the three operating (or equilibrium) 


points of the blocking oscillator. Since the loop gain of the circuit 


dl (t%) 
dip 


is obviously less than unity at the two extreme points (s and q.s.), these 
are operating points of the stable kind. In the following, these two points 
will be referred to as the “stable” and the ‘‘quasistable” operating points, 
respectively. The circuit cannot stay for an indefinite period of time at 


the q.s. point, due to the accumulation of current in the magnetizing 








Graphical solution of ‘‘steady-state equation” 
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inductance. Finally, the point U is unstable (saddle-point) since the 
loop gain here is larger than unity, giving rise to responses in the neigh- 
borhood of this point directed away from it. 

In general, the location of the operating points will depend upon J», , 
the shape and magnitude of /(2) and the turns ratio n. However, since 
the contribution to /(%) of the term containing V(%,) is very small when 
i» > O and the backward input resistance dV (i) /di, is very large, the 
bias current /,. or the bias voltage Ey, will not significantly affect the 
location of any of the operating points, provided the magnetizing induc 
tance is relatively large. In addition, this means that the stable operating 
point is more or less fixed to a position just to the left of the origin on the 
i, axis. Hence, only the locations of the unstable and the quasistable 
operating points are subject to change; such changes can only be ef 
feeted by varying /.(%) and n. The magnitudes or saturating values, 
I... Of [.(%) and n affect the location of these two points in an 
opposite manner. For example, as /,, increases or n decreases, the un 
stable point moves closer to the stable one and the quasistable operating 
point is displaced further out, provided the value of % at which J.(% 
saturates remains unchanged. Finally, if 7.(%) is made steeper close to 
the origin, this will also result in a smaller distance between the unstable 
and the stable operating points. 

On the other hand, if the magnetizing inductance, or n, is made very 


R, ] . ) V,( 1p ) Ew , 
+ RC (26) 
[ nL, (4 7 ne | Ry 


is significant with respect to {/.(%)|/n, the term containing V,(%) will 


small such that 


also affect the location of the unstable and the quasistable operating 
points. For instance, by reducing L,, (but letting n remain fixed) the 
[(%) curve is moved downward, causing the unstable operating point to 
move further away and the quasistable one to move closer to the stable 
operating point, thereby reducing the range that 2 and /(72,) traverse in 
going from one operating point to the other. 

The dependence upon nv is more complicated, however, since it affects 
both the saturation value of /(%) and n°L,,. If n is very small, n°L,, 
will also be small, tending to make the distance between the stable and 
unstable points large. On the other hand, a small n will also increase the 
saturation value of /(2), thereby tending to decrease the distance be 


tween these two points. However, since n°L,, is proportional to the square 


of n, while J, varies as 1/n, the net effect at first will be that the unstable 
operating point will move closer to the stable one when n is increased 
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from «a small initial value. Then, as n becomes larger, the effect of the 
V(i,) term becomes negligible and /,(%)/n will predominate, with the 
result that the distance between these two operating points will increase. 
Hence, as a function of n, this distance will go through a minimum. 
Therefore, in order to make sure that the magnetizing inductance does 
not affect the location of the operating points and does not become sig- 
nificantly charged during the “Transition On” interval, Z,, and n should 
be selected well beyond this minimum; that ts, 


I \ lt 
V, {2,) | Dee 
hy 


In such eases, it is an excellent approximation to neglect the effect of the 
term containing V,(%,) entirely, and use only /,(%,)/n for [(%). The 
graphical determination of the operating points then simplifies to that of 
Mig. 5. It is seen that m is now simply equal to the ratio, /,./ (1 I, 
which, by virtue of (8), means that wy may be expressed in terms of wa 
snd nm only; that is, 


(2S) 


In the following, the requirements fort riggering the blocking oscillator, 


the effect ot trigger magnitude, the shape of the collector current charae 








Approximate location of operating points 
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teristics and the coefficients of the differential equation upon rise time 


and delay and the conditions for a nonoscillatory response will all be 
discussed. It will be assumed that condition (27) is satisfied, so that the 
effect of the magnetizing inductance may be neglected. Also, for most of 
the following, /.(%,) will be approximated by an analytic function. 

These items are most easily discussed if (21) is written in normalized 
form. From Appendix B, it can be shown that the normalized equation 
governing the ‘Transition On” interval is 

. d’x : dy dy 


l + r b) 1 | bh) + 
dr’ dr° dr 


d 
1+ (1+ 5) + (q + bd) 
dr dr* 


Where all symbols are defined in Appendix B. The normalized steady 
state equation Is now 


flax x (), 50 


where both 2 and f(r) attain the values 0 and 1 at the stable and the 
quasistable operating points, respectively. Finally, the normalized out 
put voltage for this interval may be written 


g(t 


V. TRIGGER SIGNAL REQUIREMENTS 


If the bloeking oscillator is released from an initial position that is 
different from any of its operating points, it will proceed, depending upon 
the initial conditions, either to the stable or the quasistable operating 
point. For instance, if the initial values of all the derivatives in (29) are 
equal to zero, the cireuit will simply come to rest at the operating point 
that is on the same side of the unstable point as is the initial position. 
In general, however, the circuit may cross the unstable operating point, 
provided the imitial values of some of the derivatives in (29) are large 
enough. Hence, in order to make the blocking oscillator flip over from 
the stable to the quasistable operating point, the applied trigger signal 
must have sufficient magnitude and duration either to bring the circuit 
past its unstable operating point or to impart energy into it so that the 
initial values of the derivatives in (29) are large enough to make the 
circuit traverse the unstable point by itself. In this seetion the conditions 
this requirement imposes upon the trigger signal will be discussed 

The simplest possible case is that of a trigger signal with a very large 


width. In such cases, the trigger pulse will vary very slowly or be nearly 
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constant during the transition interval, and the derivatives of g(7) in 
(20) may therefore be neglected. Up to the point of triggering, the base 
current will also vary very slowly and its derivatives will consequently 
be small.* Therefore, prior to the time regenerative action takes place, 


(29) reduces to: 


x — f(x) g(r). (32) 


Under these conditions, the trigger signal acts mainly as a variable bias 
causing the load line in Fig. 4 or Fig. 5 to be displaced to the right. 
This, in turn, makes the stable and the unstable operating points move 
closer and closer together until they merge into one point. At this point 
(3) the load line is tangential to f(.v) [or 7(%)] and an infinitesimal addi- 
tional displacement of the load line will cause the circuit to have only 
one possible operating point. Hence, a “‘jump”’ to the quasistable operat- 
ing point will take place at this point, and the minimum required trigger 


magnitude is given by: 
(335 ) 


where x; represents the value the normalized base current has at the point 
of tangeney, J; that is, 2; is found by solving the equation: 


df ( 23) 
dic 


In terms of actual trigger current, this minimum corresponds to the 
vertical distance, /¢ min , between the point of tangency and the load line, 
as shown in Figs. 4 or 5. The variation of 7¢ min With » will proceed in the 
same manner as for the distance between the stable and unstable operat- 
ing points. This effect will be described more fully in the following. 
When more rapidly varying trigger signals are applied to the blocking 
oscillator, the terms containing derivatives of g(7) and x become so 
large that they must be taken into account. Since (29) is both nonlinear 
and inhomogeneous, no analytical solution exists, and it is therefore 
impossible to derive in closed form conditions that the trigger signal 
must satisfy for reliable operation. Hence, numerical methods or analog 


computer solutions must be resorted to. In the particular case, where the 


magnitude required to trigger the circuit for a particular fixed width of 
the trigger pulse must be found, the latter method was chosen, because 
repeated adjustments of parameters until the desired solution is obtained 
are much more readily accomplished on an analog computer. 


* The quantitative definition of ‘small’ is, of course, dependent on the coeffi 
cients (circuit parameters) multiplying the various derivatives. 
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To prevent overload of amplifiers and integrators and also to simplify 
the analog computer setup, (29) was first rewritten in terms of the 
normalized output voltage, y, which simplifies it to: 

od) d* 
f=2 + (+0524 045 


dr® dr dr 


dy : 
)—+y = fly + g(r)]. 


The Gaussian pulse was used as a representative trigger signal; that is, 


0. 6r,,]2 


g(7) (ie ae ge (56) 
where G is the magnitude, 7, the 50 per cent width and ry the time at 


which g(7) attains its maximum value, G. For f(x) fly + g(r 


representing the normalized relationship between collector current and 


base current, the analytical function 


f(x) 
: r > 


22? — 2x + | 


was chosen. This choice was dictated st rictly on the basis of mathe 
matical simplicity and tractability of the solution of (85) when g, a and 
b are zero. This function is depicted by the solid line in Fig. 6, from 
which it can be seen that it has a minimum at « = 0 and a maximum 
x 1, where it is equal to zero and unity, respectively. Also, f(x) has 
2n inflection point at « = $ around which it is symmetrical. The slope of 
f(x@) at this point is equal to 2. For values of x less than zero and larger 
than one it is slowly asymptotic to the horizontal line }. The normalized 
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Fig. 7 Relationships between magnitude and width of applied signal re 
quired to trigger the blocking oscillator with 6 0) 


load line of the cireuit is also shown in Fig. 6. It intersects f(r) at the 
points 2 equal to 0, 5 and 1, and these are therefore the stable, unstable 
and quasistable operating points, respectively. Although the funetion 


f(x) differs from a real 7,/% characteristic in that it is not constant beyond 


the stable and the quasistable operating points, it resembles such a char- 
acteristic sufficiently closely within the region of interest. Hence, solu- 
tions of (35) having reasonable overshoots should be expected to ap- 
proximate exact ones very closely and exhibit the main features of the 
circuit accurately. 

On the analog computer the following procedure was used. The circuit 
Was assumed to be at rest at the stable operating point prior to the aup- 
plication of the trigger pulse, and the initial value of all the derivatives 
Was set equal to zero. Then, for each value of the width 7, , the magni- 
tude G was adjusted so that the response barely flipped over to the 


quasistable operating point. The resulting relationship between the 
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magnitude and width of the trigger pulse is shown in Figs. 7 and 8 for 
various values of the parameters a and b. It is seen that the curves are 
somewhat hyperbolic in shape and that all of them are asymptotic to the 
vertical line Givin 0.15 at large pulse widths. This asymptote cor 
responds exactly to the minimum trigger magnitude given by (33) and, 
indeed, if the expression for f(a) from (37) is substituted into (33) and 
(34), the result: Is Gyyin 0.1504, which agrees quite closely with the 


computed value. From Figs. 7 and 8 it can be seen that, at large pulse 


widths, the critical trigger magnitude is nearly independent of a and b. 
As the width becomes smaller, a larger magnitude is required to trigger 





Fig. S Relationships between magnitude and width of applied signal re 


quired to trigger the blocking oscillator with } 1,2 and 4 
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the circuit and this magnitude increases with a and 6b. The curves indi- 
cate a stronger dependence of G upon b than upon a, meaning that the 
output capacitance and the load resistance affect the trigger sensitivity 
of the circuit more than the leakage inductance does. 

The fact that the curves in Figs. 7 and 8 resemble hyperbolas suggests 
that the product of the pulse width and the difference between the mag- 
nitude and its minimum value is nearly constant; that is, 


(G — Guin) tw = Na, b) (38) 


’ 


where N is a function of a, b and the nonlinear characteristic. To investi- 
gate this further, the above product was formed for all the measurements 
taken, and the results are shown in Table I]. It is seen that the value 
that NV assumes for each measurement varies randomly around its mean 
value with no consistent deviation in any direction, except at large pulse 
widths, where it is quite small, and at short pulse widths, where it is 
quite large. Taking into account the subjectiveness involved in deter- 
mining the point at which the response barely flips over to the quasi- 
stable operating point, the accuracy of the computer and the fact that at 
large pulse widths the value of G is very nearly equal to Gin , it must 
be concluded that (38) is indeed a good approximation over particular 


regions of 7, . Since the area of the Gaussian pulse in (36) is 


A | g(r )dr V/ rGr, p (39) 


x 


it can be concluded that the area of an equivalent trigger pulse of mag- 
nitude (G — G,j,), not the area of the actual trigger pulse, stays ap- 
proximately constant for given values of a and b. This brings out the 
fact that the blocking oscillator is not strictly an amplitude threshold 
device, but an energy threshold device. 

The data given in Table II can be closely approximated by a funetion 
of the form 


N(a, b) Ky + Kya + Kob. (40) 


For 1.67 < 7, S 41.75, and b # O, these values are: Ko 0.38, Ky 

0.06, Ks = 0.42. With these values in (40), the computed NV (a, b) agrees 
with the tabulation in Table II to within better than 30 per cent. In 
addition to the inaccuracies introduced by the computer and operator 
when G is close to Guin , it is to be expected that, at the largest pulse 
widths, V(a, b) will be independent of both a and b. For 7, 167, Table 
IT confirms the contention that the very slow trigger signal acts strictly to 
vary the bias. When 7,, is between 16.70 and 83.5, V(a, b) is independent 
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of a for the range of parameters covered. Only the coefficients of the 
second and third derivatives with respect to the base current are func- 
tions of a. Therefore it can be concluded that this range of 7, covers a 
transition region Where only the first derivative term (@) and collector 
capacity) is important. At the other extreme — the very narrow trigger 
pulse —— the dependence on a is larger than indicated by the value of A, 
derived for the region 1.67 S 7, S 41.75. Here the dependence on the 
higher derivatives (leakage inductance) is more pronounced, again con- 
firming our physical reasoning. 

The case b = 0 is of little practical concern, since this corresponds to 
an undamped output circuit working into a short-circuited load. 

Other implications of the relationship between trigger magnitude and 
width can be brought out more clearly if we examine the unnormalized 
version of (40). If we substitute (40) in (38), neglect the weak de- 
pendence on leakage inductance, neglect parasitic capacity from. col- 


lector to ground, and substitute from the normalizing equations, we get 


oy Ik hi ( Ie) ( { ni (4 | Ks LCR). (41) 
n Wa 


from this relationship it can be seen that the magnitude-width produet 


decreases as we Is increased until Ay/w. can be neglected with respect to 


ky 1.5C.R. . This dependence on we can also be viewed in an equivalent 


manner. Since Ky and Ky are both about equal to 0.4, (41) ean be writ- 
ten in the following form 


tM Wy 


It will be recalled, from the discussions on the equivalent circuit for 
the transistor, specifically (18), that (1 + 6)/@o is a good approxima- 
tion to the large-signal (saturating step) delay time of the resistively 


loaded common emitter stage. Therefore (42) can be written 


(I, t min) l'w = Ko (* In) To. (45) 
i 


As wa is increased, the delay time of the basic transistor stage (7')) with- 
out external feedback decreases. In the blocking oscillator this is equiva- 
lent to a reduction of one of the time constants in the positive feedback 
loop, thereby permitting a more rapid build-up of the regenerative loop, 
or, alternately, regeneration can occur for a lower trigger signal for a 


viven trigger width. 
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The dependence of this product on n is more difficult to visualize, 
since Ky , A; and Ky» are functions of the cireuit operating points, which 
in turn are dependent on n. As mentioned previously, J¢ min will vary 
with n in much the same way that the distance between the stable and 
unstable operating point varies with n. For small n, J; min will be large 
and decrease to a minimum as n increases, and then begin to increase 
with n. This can be seen from the graphical construction of Figs. 4 and 5, 
and it has also been observed experimentally. It is reasonable to expect 
that the product given by (42) will vary in a similar manner, though no 
rigorous proof can be given here. 

For trigger signals that are nonzero over only a finite time interval, 
a relationship between- magnitude and width similar to that shown in 
Kigs. 7 and 8 exists. The principal difference is that the curves cor 
responding to those of Fig. 7 or 8 are now asymptotic to a finite value 
of width as well as of magnitude. In other words, for such trigger signals 
there exists a minimum width, 7. min , below which the blocking oscillator 
cannot be triggered even when the magnitude of the trigger pulse is 
made infinite. That such a minimum width exists ean be easily realized 
by considering (35) when a square pulse of infinite magnitude is applied 
to the circuit. In such a case, the collector current generator will saturate 
immediately, making fly + g(7)| 1. Hence, the actual forcing function 
is finite even if the magnitude of g(7) is infinite, and the applied trigger 
pulse must therefore have a finite width in order to make the cireuit flip 
over to its quasistable operating point. The reason to min IS not finite in 
the case of the Gaussian pulse is the fact that this function is nonzero 
except at infinity. Thus, the width of g(7) must be zero in order to make 
the duration of the actual forcing function finite. 

These considerations may be used to calculate 7. min. AS an example, 
let us consider the case when a b 0. The solution to (35) for 7 s 
w min IS then simply 


y(r) l a (43) 


and the minimum width is now the time sufficient to bring y up to the 
unstable operating point; that is, t. This gives tw mi: 0.693. In 
the more general case, When a and b are nonzero, the procedure Is SOC 
what more complicated. It involves finding first the relationship between 
y and its first and second derivatives necessary to slide the circuit past 
its unstable operating point when g(7) is zero, and then determining the 
width that will make these quantities satisfy this relationship. Since this 
procedure is usually quite laborious, it is more practical to estimate 


Tw min {rom two points on the magnitude-width curves 
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VI. DELAY AND RISE TIME 


The delay and rise time of the blocking oscillator are, in general, de- 
pendent upon all parameters in the circuit. The most important of 
these are the magnitude and width of the trigger signal, the shape of the 
i./ characteristic, the cutoff frequency wo (or wa and n) and, finally, 
the collector capacitance in conjunction with the load resistance and the 
leakage inductance. To see clearly how each particular parameter affects 
the delay and rise time, these will be discussed separately, although it 
may not be possible to vary some of them independently in the actual 
physical circuit. In the following, the delay time is defined as the time 
difference between the maximum of the trigger pulse and the 50 per cent 
point of the normalized output. The rise time is the time required for the 
output to traverse 10 to 90 per cent of its final value. 


6.1 The Effect of Trigger Magnitude and Width 


It 


blocking oscillator is simply released from various initial positions. In 


s instructive to consider first the transient responses when the 


such cases these responses are obtained from (35) by setting g(r) 0 
and assigning initial values to y and its derivatives. In particular, if the 
collector capacitance and the leakage inductance are so small that they 
may be neglected, an analytical expression for these responses may be 
obtained. If we limit ourselves to this particular case by setting a 

b = Oin (35) and using the analytical approximation to the 7, /% charae- 
teristic, the solution to this equation becomes (by simple separation of 


\ ariables } 


( ) (O44. \ig(G--) — 
yy | y(O+ )ly(0-4 : i, (44) 


y— UO y(O+) — 0.9 


where (0+ ) represents the initial value of y. The bistable nature of the 
blocking oscillator is reflected in (44), because, as r — *%, y approaches 
either the stable (y 0) or the quasistable (y 1) operating point. 
Which one it will attain in an actual case depends upon whether the 
initial position is below or above the unstable operating point. In the 
neighborhood of the point (y 0.5) the output voltage becomes an ex- 
ponential ascending function of time, thereby exhibiting the unstable 
characteristic of this point. Fig. 9 shows the responses of (44) for six 
initial values located between the unstable and the quasistable operating 


points. It is seen that, as the initial value of y is decreased toward the 
unstable operating point, the delay and rise time of the circuit become 


larger, increasing rapidly as y(O+ ) approaches this point, To minimize 
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Fig. 9 Normalized time response of the blocking oscillator when the circuit 
is released from various initial positions 


rise time, the magnitude and width of the trigger signal should at least 
be such that the circuit has been brought well beyond the unstable point 
when the trigger terminates. 

When the coefficients a and 6 (or the trigger function) are not equal 
to zero, (44) cannot be solved analytically and numerical methods must 
be used. In Fig. 10 the results of solving this equation by the Runge- 
Kutta method on a digital computer are shown. Again, the analytical 
approximation to the 7./% characteristics was used and the Gaussian 
function represented the trigger signal. To conform with values encoun- 
tered in a particular circuit, a and b were, in all of these cases, made equal 
to 0.447 and 0.8, respectively. In the first of these figures, Fig. 10(a), 
the normalized response of the base current, x, and the output voltage, y, 
to a slow trigger pulse, are shown. It is seen that the output voltage does 
not change appreciably until g(7) reaches the eritical triggering level, 
which, from Table II, is equal to 0.167 for 7, $1.75. Subsequently, the 
output voltage switches rapidly in comparison to the trigger signal to 
the quasistable operating point. Hence, one should expect, in the case of 
a slow trigger signal, that the rise time of the blocking oscillator is more 
or less independent of the trigger magnitude as long as this magnitude 
is larger than the critical value necessary to activate the circuit. The 
upper curve in Fig. 11 bears out this statement. Here, the rise time is 
shown as a function of pulse magnitude, and it is seen that, except for a 


trigger magnitude close to the critical triggering level, the rise time is 
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virtually constant over the whole range. Thus, one may conclude that 
in this case the rise time is, practically speaking, dependent only upon 
the parameters of the blocking oscillator itself. The delay time, on the 
other hand, will of course vary with trigger magnitude, but only to the 
extent of the time it takes the trigger signal to reach the critical triggering 
level. 

Figs. 10(b) through 10(d) show the response of the blocking oscillator 
to a short trigger pulse of successively increasing magnitude. Actually, 
in all figures except Fig. 10(¢) two trigger pulses were applied, one that 


(d) 


Fig. 10 Computed responses of the blocking oscillator to trigger pulses of 
various widths and magnitudes: (a) slow trigger pulse; (b) inadequate trigger 
pulse; (¢) trigger pulse having just limiting magnitude; (d) adequate trigger pulse 
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Fig. 11 Rise time as a function of trigger pulse magnitude 


turns the blocking oscillator ‘ton’, followed by another that turns it 


“off. This was done in order to simulate the conditions for a blocking 


oscillator that might be used in a regenerative repeater. In Fig. LOC b) the 


magnitude of the trigger signal was insufficient to activate the blocking 
oscillator, while in Fig. 10(¢) it was just barely large enough to make the 
circuit flip over to the quasistable operating point. The rather large rise 
time and the infleetion point at y 0.5 should be noted in the latter 
case, Which, as explained previously, is due to the fact that the circuit is 
now essentially released from a position close to the unstable operating 
point. As the magnitude is increased further the rise time at first decreases 
rapidly and then levels off, approaching a limit for large values of G 
This is depicted by the lower curve in lig. 11. In this case the rise time 
is largely independent of trigger magnitude as long as the magnitude is 
at least 50 per cent larger than the critical triggering level. It should be 
pointed out, however, that the apparent leveling off of the rise time in 
the fast pulse case is only true when wy may be considered to be inde 
pendent of trigger magnitude. If a fast and large trigger pulse is applied 
such that the base current significantly exceeds the value /,, during 
transition, the maximum value of this current should, according to (7), 
be used to calculate w) rather than /,, . This results ina larger wy and 
therefore a smaller rise time than that indicated by Fig. 11. The correction 
to be applied to wy in such cases is, by virtue of (6) and (8), equal to 
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In most practical blocking oscillators, however, the stable and unstable 
operating points are located very close together, so that such correction 
is rarely needed. Also, the desirability of a large power gain favors the 
use of a design requiring only small trigger magnitudes. Finally, in the 
case of a slow trigger signal, such correction is never needed, since here 
the trigger signal does not change appreciably during the relatively fast 
transition from the stable to the quasistable operating point. 


6.2 The Effect of the Shape of the i./%% Characteristic 


To see how the shape of the 7./7, characteristic affected the delay and 
the rise time of the blocking oscillator, transient responses were computed 
from (35) using for f(a) the functions depicted in Fig. 6. In all cases, the 
location of the stable operating point was kept fixed and the circuit was 
assumed to be initially at rest at this point. To trigger the circuit a 
Gaussian pulse was used which had a 50 per cent width of 4.175 and a 
magnitude in each case equal to the distance between the stable and 
unstable operating points. In all cases, a = 0.447 and b = O08. 

In the first set of computations the location of the unstable operating 
point was kept fixed while the slope of the characteristic through this 
point was varied by using four different functions for f(a). These are the 


cases numbered | through 4 in Fig. 6, the fourth one being essentially 


) 


Case 3 multiplied by ten. The slope at the unstable operating point at 
each of these cases, which on an unnormalized scale represents Bmax” 
of the circuit, are, respectively, 1.14, 2, 8 and 80. However, the average 
value of 8/n is the same in all cases, since the quasistable operating point 
is located at (1, 1) in the first three cases and at (10, 10) in the last one. 
Figs. 12(a) through 12(d) give the results of computing the transient 
response from (44) in each of these cases. In Fig. 13 the rise time is 
plotted as a function of Bmax/n. It is seen that the delay and rise time 
change only slightly over the whole range of Bmax/n; the most rapid 
variation being confined to the region where Bmax/n is nearly equal to 
unity. The slope must be greater than this value for f(z) to intersect the 
load line at three points. Hence, it can be concluded that the rise time 
decreases very little when Bmnax/n is increased, as long as this quantity is 
larger than unity. In fact, Fig. 13 shows that a 70-to-1 change in loop 
gain results in only a 30 per cent change in rise time, and that the bulk 
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G=0.5, Tw=4.175 
a=0.447, b=0.8 








Fig. 13 Rise time as a function of Bnas/n 


of this change occurs for the first 7-to-1 increase in loop gain. The reason 
for this somewhat surprising result is, of course, the fact that the average 
of 8/n, and therefore the average of the loop gain, is the same in all 
these cases. 

Finally, it should also be noted in Figs. 12(a) through 12(d) that, 
while the rise time decreases as the slope of f(x) becomes steeper, the 
delay increases. This is so because the collector current starts to flow 
much earlier in the case of a gentle slope than in the case where f(r) is 
steep. 

To investigate the effect of moving the unstable operating point, the 


transient response was also computed for Case 5 and Case 6 in Fig. 6. 


The unstable operating points in these two cases are, respectively, located 


ina = O.l and «2 = 0.9 and both functions have slopes identical to that 
of Case 3. Comparing the resulting responses of Figs. 12(e) and 12(f) 
with that of Fig. 12(¢), it is seen that, as far as the delay and rise time 
are concerned, the location of the unstable operating point has even less 
effect than the slope has. A slightly smaller rise time is obtained when 
the unstable operating point is located farther away from the stable one, 
since the trigger is larger in such cases. 

Before concluding this section, it should again be pointed out that the 
dependence of a) on the average value of 8 must be taken into account 
when the above results are applied to actual circuits. The faet that 
| ‘wy and Cy are nearly proportional to B{(7) and (15)| tends to override 
the somewhat weak dependencies discussed above. Transistors should 
therefore be selected for large wy and small Co rather than a high 6 to 
obtain the best rise time. On the other hand, a high-low current 8 is 


desirable to increase the trigger sensitivity of the circuit. Finally, the 
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above results for a single pair of values of a and 6 and a single value of 
T» apply to other values of these parameters. The basic shapes of the 
curves in Figs. 11 and 13 remain practically unaltered. The principal 
effect of other values of these constants is to displace these curves both 
horizontally and vertically, with the vertical displacement the most pro- 
nounced effect. 


6.3 The Effect of Leakage Inductance and Collector Capacitance upon the 
Delay and Rise Time 


To investigate how the leakage inductance and collector capacitance 
affect the delay and rise time, the transient response of the blocking os- 
cillator was computed from (35) for a large number of values of the 
constants a and b. Four different types of trigger signals were used 
throughout this investigation. In the first case, the effect of a very slow 


trigger signal was simulated by employing a Gaussian pulse for g(7) of 


7 


magnitude and width equal to G 0.3 and tr, 166.7, respectively. 


As pointed out in a previous section, this represents the case in which the 
rise time is solely dependent upon the properties of the feedback loop of 
the blocking oscillator and not upon the detailed behavior of the trigger 
signal itself. In the next two cases, Gaussian pulses of medium (7, = 25 
and extremely small (7, 1.175) widths were used, illustrating situa- 
tions where both the trigger signal and the circuit properties affect the 
rise time. In all three cases, however, a magnitude of the trigger signals 
was selected such that the circuit operated on the flat portions of the 
curves in Fig. 11. In the fourth and last case, a step function of magni- 
tude sufficient to saturate the transistor immediately (G 1) was 
applied. Since the entire driving capability of the transistor is now em 
ployed from the very beginning, this represents the situation in which 
the delay and rise time are minimized. Also, the loop gain is zero during 
the entire transition, so that the degenerative effect of positive feedback 
on rise time is nonexistent in this case. Finally, in all cases, the blocking 
oscillator was assumed to be initially at rest at its stable operating point, 
and the analytical function in (37) was used to represent the ¢./7 charae 
teristic. 

Figs. 14(a) through 14(d) give the delay time as a function of the 
constants, a and b, in the four cases. It is seen that in nearly all cases the 
delay is a linear function of b when either this constant or a is consider 
ably larger than one. The only noticeable deviation from this rule is the 
a 8 curve in Fig. 14(b), which starts to bend upwards for large b. The 
reason for this is that here the critical triggering level approaches the 
actual trigger magnitude as b is increased; in other words, the circuit is 
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Fig. 14 The delay as a function of a and 6 when the trigger signal is (a) a 
very slow Gaussian pulse; (b) a moderately fast Gaussian pulse; (c) a short Gaus 
sian pulse and (d) a step function 
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no longer operating on the flat portions of the curves in Fig. 11. From 
ig. 14(a) it should be noted that, for values of b less than 4, the delay 
goes through a minimum when a is varied. 

Figs. 15(a) through 15(g) show the rise time as a function of a and b 
for the four trigger functions. Comparing these figures, it is seen that the 
rise time is largest in the case of a very slow trigger signal and smallest 
in the case of the saturating step function. However, the variation in 
rise time with b is much less in the former case than in the latter one. 
while with a it is somewhat the other way around. Also, the dependence 
is stronger upon b than upon a. Finally, in all cases, the rise time goes 
through a minimum when a is varied and b kept fixed, this minimum 
being most pronounced in the case of a moderately fast trigger pulse 
{Fig. 15(d)]. 

Since it is usually not possible to obtain an analytical solution to 
(35), an expression for the rise time that is valid under all conditions 
cannot be found either. However, in two of the cases discussed above, 
approximate expressions for the rise time were found to fit the computed 
data over a limited range. The first of these pertains to the case of a 
very slow trigger signal and was obtained by a trial-and-error procedure. 
It was discovered that the expression 


ij 


Tr 7.85Y1+ 2b4+ a (46) 


approximates the computed data in Fig. 15(a) within +10 per cent. 
The second expression for the rise time is concerned with the case where 
a unit step function is applied to the circuit. Since the transistor is 
saturated immediately in this case, (35) is linear with f[y + g(7r)} l, 
and the transistor can be characterized by the transfer function 


Y(s) _ l 
U'(s) (s + 1)(a’s? + bs + 1)’ 


(47) 


where U'(s) is the Laplace transform of the unit step function. The 


poles of the transfer function are given by s —1, —a, —8, where 


b+ v/ b? - 4a? Vv b? — 4a? 
and 


; - (48) 
2a’ 2a? 


It can be shown that a system with a transfer function whose only 
singularities in the finite part of the complex plane are negative real 
poles will have monotonic step response.” For the blocking oscillator, 
a and 8 are real when b 2 2a and a monotonic step response results. 
Under these conditions, Elmore’s definition” of delay and rise time gives 


results in good agreement with those obtained by the exact calculations 














Fig. 15 Rise time of the blocking oscillator as a function of the constants a 
and 6 when the trigger signal is: (a) and (b) a very slow Gaussian pulse; (¢) and (d) 
a moderately fast Gaussian pulse; (e) a very short Gaussian pulse; (f) and (g) a 


step function of sufficient magnitude to saturate the transistor immediately. 


S20 
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used to obtain the results of Fig. 15. Application of Elmore’s definitions 
yield the normalized delay 

( 14 ) 
and the normalized rise time 


re = 22V71+ 8 — 2a? 


or, in unnormalized form (neglecting parasitic capacity 


| 
. + PR.C% (1 + n)( + L5ntc.), 


As expected, n and F.C, should be selected as small as possible to reduce 
the rise time while w. should be large. Equation (52) suggests that there 
exists a value of the leakage inductance that will minimize the rise time. 
The exact value of the leakage inductance that accomplishes this cannot 
be determined from (52), since it is strictly valid only when the re 
sponse is monotonic. 

Finally, in Figs. 16(a) through 16(¢) the overshoot is shown as a 
function of a, with b/2a as a parameter. It is seen that, as b/ 2a increases, 
the overshoot decreases, becoming zero when b/2a = 1. It should be 
noted that, for fixed values of b/2a, the overshoot approaches a constant 
when a is large. Spot checks of these analog computer results were made 


on a digital computer, with excellent agreement between the two results. 


6.4 Conditions for Monotonic Response 


In the preceding section it has been shown that the blocking oscillator 
will have monotonic or oscillatory response when it is driven by a saturat 
ing step function, depending on whether 6 is greater than or less than 2a 


respectively, lor example, the response will be monotonic if 
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Fig. 16 The percentage overshoot of the blocking oscillator when the trigger 
signal is (a) a very slow Gaussian pulse; (b) a moderately fast Gaussian pulse and 
c) a step function 


The condition 6 2a yields two coincident real roots and divides the 


a-b plane into regions of monotonic and oscillatory responses, as shown 


in Fig. 17. Two real roots can also result when either @ or 8 is equal to 
-1. This leads to the parabola 


b l+a (55) 


in the a-b plane. At the point (1, 2) this parabola is tangent to the 
straight line b = 2a and the blocking oscillator has three coincident real 


poles at s 
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There is one exception to the above division in the a-—b plane. 
When b = 2a’ and 0 < a < 1, a pair of complex conjugate poles whose 
real part = —1 arises. The poles all lie on a line that is parallel to the 
imaginary axis in the complex plane and spaced one unit to the left. In 
this case, the impulse response is tangent to the zero line, so that the 
step response is monotonic. It can be shown that in the region b 2 2a", 
0 < a S 1 the response will be monotonic. The rise time in this case with 


b = 2a’ is given by 
Tr 2.2 V1 + 4a‘ — 2a’. (96) 


The above rise time is 8 minimum when a = 0.5; the minimum rise time 
then is 1.87. Actual transient calculations for this condition yield a 
minimum rise time of 1.45 at a = 0.45. In this case, the 10 to 90 per cent 
rise time is faster than it is in any of the other cases of monotonic re- 
sponse. However, the response proceeds quite slowly from the 90 per 
cent point to its final unit asymptote. This behavior explains the dis- 
crepancy between the results of (56) and the exact transient calcula- 
tions. 
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Fig. 17 Conditions for monotonic step response 
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Ideally, the blocking oscillator should have a response that has both 
as small a rise time as possible and a flat, smooth top. The best physical 
compromise to this requirement is to make the circuit nearly critically 
damped during this interval. In other words, the coefficients a and b 
should be selected such that they lie in a region close to the straight line, 


b 2a, in Fig. 17, 1.e., 


Re. = 2 '/ C 


Furthermore, Figs. 15(f) and 15(g) show that a minimum rise time 
can be achieved for small values of a and 6, In faet, a rise time faster 
than the minimum rise time of the transistor itself (2.2/9) can be ob 
tained. Some overshoot is associated with these cases. This result is 
similar to that achieved with series peaking in a conventional interstage, 
and confirms the conclusions of Linvill and Mattson.” However, in this 
case the blocking oscillator is acting as an overdriven amplifier with zero 
loop gain. The larger the drive the faster the response, since wo increases 
with drive. Smallest rise time can only occur when the power gain re- 
quired of the blocking oscillator is low. This requirement may be satis- 
fied in some computer applications where the primary function of the 
blocking oscillator is to retime a relatively sharp pulse train. In applica- 
tions where the blocking oscillator must reshape as well as retime a pulse 
train, such as in pulse code modulation, considerable power gun will be 
demanded from the circuit and the minimum rise time cannot be 
achieved. This is the analog of the familiar gain-bandwidth product for 
linear systems. 

When signals other than the saturating step function are used to 
trigger the blocking oscillator, the manner in which the quasistable 
operating point is approached Is dependent on the parameters a and 4, 
the nonlinear characteristic and the trigger signal. When the transistor 
saturates after suitable triggering fly + g(7)| 1. At this point, as 
noted previously, (35) is linear and the Laplace transform of the output 
can be expressed as: 

I 


ys) t sigs 
sia‘s’ + (a* 1. fh)? + (] + b)s + 1} (O4) 


1 


la 42s e--(1 4 b)ly(t,) 4 la’s + (a 4 b)ly’(t ) + ay” (t.) 
as* + (a? + b)s? + (1 + b)s + 1 


where y(t,), y/(t,) and y”(t,) are, respectively, the values that y, its 


first and its second derivatives attain when the transistor becomes 
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saturated. The kind of response y(¢) will have is therefore dependent 
upon both the initial conditions at saturation and the roots of the char- 
acteristic equation 


as’ + (a° + b)s’ + (1 +b)s+ 1 
(s+ ] (a's +bs+ 1) 0. 


Relationships between the initial conditions and a and b can be estab- 
lished such that the quasistable operating point is approached mono- 


. 13 ryyy: . : . ‘ 
tonically.” This will not be pursued here, since the values of the derivatives 


at saturation are not normally measured quantities. On purely physical 
grounds, it would be expected that the conditions given previously for 
monotonic response in the case of the saturating step function should 
also apply here. This follows when it is realized that the saturating step 
function brings the full capability of the transistor into play immedi- 
ately, thereby imparting maximum energy into the output circuit. For 
other trigger signals this will not be true, so that the energy transferred 
to the output circuit will be smaller than it will be in the case of the 
saturating step. Since the values of the output function and its deriva- 
tives when the transistor goes into saturation are a measure of this 
energy, it would appear that, if no overshoot occurs with the saturating 
step, none should occur for other trigger signals. This heuristic argument 
is bolstered by the results displayed in Figs. 18(a) and 18(b). These 
figures give the analog computer solutions to (35) for various values of 
a and b when either a fast or slow trigger is applied to the circuit. In both 
cases the response adheres to the conditions given on Fig. 17 for mon 


otonic response. 


VII. THE “ON” INTERVAL 


After the blocking oscillator has reached the quasistable operating 
point the magnetizing inductance becomes charged slowly, reducing the 
current fed back to the base. This causes the load line, and therefore the 
quasistable operating point, to move to the left, in Fig. 4 or 5, until the 
load line is tangential to the /(%) curve. Beyond this point (J’), the 
load line intersects /(%) only in the cutoff region, and the circuit now 
has only one possible operating point. Hence, a rapid “jump” toward 
cutoff takes place at J’, and it is therefore natural to define the “On” 
interval as the time it takes the base current to decrease from the quasi- 
stable operating point to the value it has at the point of tangency, J’, 
that is: 


FF a lp < I, 
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In Fig. 19 the results of solving the complete equation [(116) of Ap- 
pendix B] of the blocking oscillator on an analog computer are shown. 
The analytical representation (37) of the 7./% characteristic was used 
and V(2) was approximated by two straight-line segments. Also, the 
constants a, b, d and e were kept fixed while the constant c, which is pro- 
portional to the magnetizing inductance, was varied. It is seen that, ex- 
cept when c is small and the circuit oscillates, the base current and the 
base voltage vary rather slowly during the “On” interval. Hence, during 
this interval terms involving derivatives of x and V(x) may be neglected. 
This also follows from the fact that, in all cases of practical interest, the 
magnetizing inductance is very large compared with the other storage 
elements in the circuit. For the same reason, the amount of current col- 
lected by this inductance during the ‘Transition On” interval is also 
assumed to be negligible, i.e., m(7;) = 0. Assuming the trigger signal to 
have terminated by this time, the equation governing the normalized 
base current during this interval therefore reduces to 


’ 


x—h(x) + J | V(x) dr =0 
C Jo 


dx V(x) 
: dr 4 dh(x) — 


dx 
The unnormalized version of this equation becomes 


° diy Vile) — Eo 
whic St > Some oe, (61) 
ok ee 


! d lp 


The above equation reflects the operation of the circuit during this in- 
terval exactly as it was described at the beginning of this section. First, 
it indicates that the base current decreases rather slowly, since L,, is 
large and V,(%) — Ew small. Then, when d/(%)/di, becomes equal to 
unity [which corresponds to the point of tangency (s’) in Fig. 4], it 
predicts that a jump takes place, by virtue of the fact that dz,/dé x 
at this point. Since 


. r (is 
p! + dV, ty) 


1, — Ct) _ diy R, dI.(is) 
dt R,’ 4 dV (2) dt» 


dry 
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it can be seen that the “midband” loop gain of the circuit is equal to 
unity at the point where the jump takes place. 

As a first step toward deriving a formula for the natural pulse width 
of the blocking oscillator, let us introduce the above expression into (61) 
and also take advantage of the fact that V,(%) is very nearly linear in 
the saturation region, so that V,(%) = red. Thus, (61) becomes: 

. Ew 
Rs’ + rr dis wo 
Rs’ Tos dt l R,! dl (1%) 
a n R,’ Tt Tbe dt, 


nN Lim 





By substituting the analytical approximation for /.(%), the above equa- 
tion may be solved exactly.'* However, in cases of practical interest, the 
quasistable operating point extends quite far into the saturation region 
and, furthermore, most 7./% characteristics bend quite sharply before 
going into saturation. Therefore the jump point, J’, is positioned very 
close to the saturation level. Hence, the transistor is saturated during 
almost the entire On” interval with the result that d/.(%)/di is equal 
to zero except in the near vicinity of the point s’. For the purpose of 
deriving an expression for the pulse width, it is therefore an excellent ap- 
proximation to neglect the loop gain term in (63). Solving this equation 
under these conditions, introducing % as equal to J), and J); at the be- 
ginning and end of the “On” interval and substituting for Ry,’ from (24), 
the expression for the natural pulse width may be written 


esi re [1 +o (4 Rac») Baga ae 
Te ~ tlm - nLom Ne js 


— . In 
I; 





Rirp 


In most circuits 


R, > ne + Rs (: + RCs) and TI), 
n? Lim Wi) 


so the above equation reduces to 


z Ew 

| ane 

> Kw 
Teste 


(66) 


Figs. 20 and 21 show the normalized pulse width as a function of n for 
various values of /5;//.. and 2y/T1,.?. as computed from (66). It is seen 
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Fig. 20 — Pulse width as a function of turns ratio, with Fo./Iero. = —1. 


that the pulse width goes through a maximum and is nonzero over only 
a finite range of n, the limiting values of n being 0 and J,,/J»; . Phys- 
ically, the upper limit corresponds to the coincidence of the quasistable 
and the unity loop gain points, making it impossible for the circuit to 
remain in a conducting state for a finite time. It should be noted from 
these figures that, for a fixed turns ratio, the natural pulse width exhibits 
a strong dependence upon both Hy/J-sro. and J../Ty; , i.e., on both the 
nonlinearities and the bias. 

Before concluding this section, two things should be pointed out. First, 
the quantity /,; in (66) is, strictly speaking, also a function of n, be- 
cause, as n is varied, the point of tangency in Fig. 4 changes. However, 
since the 7,/2 characteristic usually bends quite sharply before going into 
saturation, J,; will vary only slightly with n. Second, in the above analy- 
sis the effect of minority carrier storage has been neglected. The presence 
of such carriers in the collector junction introduces a delay before the 
transistor can leave the saturation region; as if the 7(%) curve had been 
temporarily moved somewhat to the left in Fig. 4, thereby causing the 
pulse width to be slightly larger than that predicted by (60). However, 
this effect may be partially accounted for by calculating the pulse width 
from (64) rather than from (66), using, as explained in the section on the 
equivalent circuit, the value of w) appropriate to turnoff. 


VIII. THE “TRANSITION OFF” INTERVAL 


In discussing the ‘Transition Off’ interval one must discriminate 
between two cases: one in which the blocking oscillator turns off by 
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itself, and the other when it is triggered off by virtue of an externally 
applied signal. In the following, these two cases will be discussed sep- 
arately. 


8.1 Internal “Turn Off” 


As explained in the previous section, when the magnetizing inductance 
becomes sufficiently charged so that the load line is tangent to the J (7) 
curve at the unity loop gain point, J’, a rapid transition toward the 
other point of intersection between the load line and the characteristic 
takes place. Since the input impedance to the transistor is very high in 
the reverse direction, the bias current, J», , is very small and the /(%) 
curve is almost vertical in the cutoff region. Thus, the end point of the 
transition is essentially located on the 7% = 0 axis. Hence, it is natural in 
this case to define the “Transition Off’ interval as the time it takes the 
base current to vary between J,; and /,, = 0. The transistor is therefore 
conducting over almost this entire interval and the same approxima- 
tions can be made here as for the “Transition On” interval. Accordingly, 
the equations governing the responses in the two intervals will be alike, 
the only differences being that the trigger signal is absent and the mag- 
netizing current is 


20 


























Pulse width as a function of turns ratio, with E,, 
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ti) «thm / [ + (; | eR ace wa 2 
Nn? Lim \wo Ry (67) 


T.(1;) — nto, 


or, in normalized form, 


‘ee 
m(7;) J\2j) - (1 + V(2x;) — 2; = f(zj) — aj. (68) 


c 


As explained in the section on the equivalent circuit, a different wy that 
tukes into account the effect of minority carrier storage must now be 
used. This latter modification usually results in a fall time that is from 
two to ten times that of the rise time. 

The fall time in this case may be calculated in an alternate way which 
permits one to use the results in Section IV. The accumulation of cur- 
rent in the magnetizing inductance acts like a slowly varying bias that 
slides the load line toward the point of tangency, at which point rapid 
regeneration takes place. This fact suggests that the mode of operation 
now is very similar to the case in which a slow trigger signal is applied 
to the circuit. Hence, if one defines the fall time as the time taken for 
the response to traverse 90 to 10 per cent of the interval between the 
stable and quasistable operating points, one should expect that the fall 
time can be obtained directly from Fig. 15(a) or (46), provided the wy 
valid in this interval is used. Indeed, comparisons of rise and fall times 
between Figs. 15(a) and 19 bear this out. 


8.2 External “Turn Off” 


When the blocking oscillator is turned off by an external signal the 


natural pulse width is always selected to be much larger than the interval 


between the “Turn On” and the ‘Turn Off” pulses, in order to secure 
reliable operation. The magnetizing inductance will therefore not be- 
come significantly charged during the time the circuit is on and condi- 
tions similar to those during the ‘Transition On” interval exist. By using 
the value wy, which takes into account the effect of minority carrier 
storage, the fall time in this case may be calculated from the results in 
Section IV. 
IX. THE ‘RECOVERY’ INTERVAL 

This interval is defined as the difference in time between cutoff and 
the time the circuit comes to rest at the stable operating point. During 
this interval, assuming that the recovery is overdamped or slightly 
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oscillatory, it is reasonable to assume that the transistor is essentially 
cut off. This means that the 8 of the transistor is zero, f(x) is essentially 
zero and Cy reduces to 


Co = Co + C,. (69) 


In addition, the terms involving the normalized base voltage will be 
considerably larger than those containing the normalized base current 
in the complete equation governing the circuit operation. Since the terms 
containing derivatives of the normalized base voltage are unimportant 
in the preceding operating intervals, their coefficients can be modified in 
terms of the values they assume during recovery without affecting per- 
formance in the other intervals. In this case, we do not operate on the 
base voltage terms with the expression 


(1 + I “yi T.(%), (70) 
wo dt 


since the transistor, except for its output capacity C. , is effectively dis- 
connected from the output circuit while it rings out. In addition, the 
parameter e which arises from the capacity across the transformer is 
lumped in with the constant d, to give a value of d modified to 


di ot (c. 40) (: ft i: ) 4 cr]. (71) 
n? Lm 


Finally, derivatives of v higher than the second were neglected. With 
this reasoning, the terms that remain involving V(2) in the normalized 
equation of the blocking oscillator are 


(a;’ + be) A - (b+ d,) + (1 } 


dt’ dt 


where 
with Co C. - Ca: 


With this change, the equation governing the performance of the block- 
ing oscillator over its complete cycle of operation was simulated on an 
analog computer to verify the contention that the transistor is cut off 
during recovery, and to give quantitative substance to the assumption 
that only the V(2) terms need be considered. Again, the analytical fune- 
tion was used to represent the 7-2, characteristic, while the normalized 
base voltage function was approximated by two straight line segments. 
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The circuit was turned on by a Gaussian trigger pulse of width 7, = 
4.175 and magnitude G = 0.5, and left to its own devices to turn off. 
The computer results when the constant d; was varied and the remain- 
ing constants fixed are shown in Fig. 22. The main effect of varying d, 
is to change the response after the blocking oscillator has turned off. 
Recovery is rather slow, and decreases as d is increased. It should be 
noted that, as long as d is not too large, the overshoot of the base cur- 
rent is small, confirming the assumption that the transistor remains es- 
sentially cut off during recovery. The undershoot of the output voltage 
is rather large, becoming larger and narrower as d; is increased. This is 
largely due to the fact that the transformer capacity has been assumed 
negligible compared to Cy (i.e., e = 0). When e is greater than C4, it 
will be found that the undershoot of the output voltage is quite small. 
This will be discussed more fully in a succeeding paragraph. From Fig. 
19 it can be seen that recovery is very much dependent on the param- 
eter c. This would be expected physically since ¢ is associated with the 
energy stored in the magnetizing inductance. 

From the above results it can be seen that the transistor is essentially 
cut off during recovery and that the base-emitter junction of the tran- 
sistor is back-biased, assuring that V(2) > x. Therefore, all the assump- 
tions of the beginning of this section are valid and the equation govern- 
ing recovery can be written from the Appendix and (72) as 


(a; + bie) 7 + (b, + dt e) ay + ( - 
dr? dr 


ot (74) 
He | 


dr dr? 


V dr + m(1r;) = (: + by : + ay : ) o(+). 


If we differentiate (74) we get, after some rearranging, 

‘ PV vy iV . 
c(a;? + be) —— + c(b, + d + ec) —— + (bh te) — +} 
dr’ l lr 


aT ( 


d ad , a 
. + 5, —— + _ (7). 
(¢ "dy? . S) " 


Whether the response of (75) is monotonic or not depends upon the 
initial conditions when the recovery interval is entered, as well as the 
parameters a through e. In the absence of information concerning the 
initial conditions, we can only determine whether or not the response 
will be nonoscillatory. This can be determined by using the discriminant 
for the cubic on the homogeneous equation. Instead, we assume that 
the third derivative term can be neglected and deal with the second-or- 
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der equation in V. This approximation appears to be justified in practice. 
In this case, it is readily shown that the recovery response will be non- 
oscillatory if 

(b +c)” (e-—b) ec 


d+es te — >) = ic = + (76) 


A 


In unnormalized form, this becomes 





n? Lim 
Tn i 
RAC, + C,) + 2 | LnCr + Ln(Ce + (1 + ) - 
4m (44) 
Q nv ae 1/2 
2\C. + C, + Cr , 
During this interval, the output voltage is given by 
dV sy ' 5 oe 
y rte 2 + V dr + V + m(7;) — g(r). (78) 
aT C vr 


i 
When the transformer capacity (directly proportional to e) is much 
larger than C, + C,, and the derivative terms of g(7) and the highest 
derivative of V are neglected, (74) becomes 


dV 


, eae = 
+ VY + | V dr + m(r;) — g(r) = 0. (79) 
dr C vr 


Comparing (79) and (78), it can be seen that they are identical when 
x (0) y. During recovery, x is small compared to V. The fact that 
y = O implies that the output voltage is essentially zero during recovery 
when the transformer capacity Cr > C. + C,. This must be so from 
physical considerations, since the current through the load must of ne- 
cessity be small when the output impedance of the transistor becomes 
quite high. 


X. MULTIPULSE RESPONSE OF THE BLOCKING OSCILLATOR 


To investigate how the blocking oscillator should be designed in order 
to work reliably as a regenerator in a PCM repeater, (116) of Appendix B 
was solved on an analog computer with g(7) consisting of a sequence of 
alternating positive and negative Gaussian pulses. Each “Turn On” and 
“Turn Off” trigger pulse was spaced 15 time units apart, and the magni- 
tude and width of these were G = 0.5 and r,, = 4.175, respectively. The 
analytical expression (37 ) was again used to represent the 7,/7 characteris- 
ticand V(2) was approximated by two straight line segments. The con- 
stants a and b were selected such that the rise time of the circuit was small 
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Fig. 23 Response of the blocking oscillator to a sequence of ‘“Turn On” and 
“Turn Off” pulses when a? = 0.2, 6 = 0.8,¢ = 5andd = 0.28. This value of ¢ cor 


responds to a natural pulse width of 7, = 10. Each “Turn On” and ‘Turn Off” 
pulse was spaced 15 units apart and had a magnitude and width of G = 0.5 and 
tT = 4.175. The maximum of the first ‘‘On’’ pulse occurs when +r = 7.5. 


compared to the spacing between the trigger pulses and d was picked 
such that the recovery was reasonably fast. Finally, it was assumed 
that the blocking oscillator was initially at rest at the stable operating 
point so that m(0) 0. 

The results are shown in Figs. 23 through 26 for various values of 
the constant c; that is, the natural pulse width of the circuit was altered 
in these cases. In Fig. 23 the pulse width was equal to 10, and therefore 
it was less than half the period of the trigger signal. It is seen that, not 
only do the height and width of the output pulses vary considerably, 
but the timing information contained in the trigger signal is almost 
completely destroyed as well. In Fig. 24, in which the natural width 
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Fig. 24 — Response of the blocking oscillator to a sequence of ‘Turn On”’ and 


“Turn Off’? pulses when c = 10. 
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was almost doubled, conditions have improved considerably. However, 
variations in width and timing are still intolerable. By increasing the 
width further by almost 40 per cent, as shown in Fig. 25, the result 
becomes acceptable, although the undershoot of y varies considerably. 
Finally, in Fig. 26, where the natural pulse width is about 1.4 times the 
period of the trigger on signal, the train of output pulses is almost com- 
pletely uniform, with scarcely any change from one pulse to another. 
Hence, for the blocking oscillator to perform reliably as a regenerator, 
its natural pulse width should be at least equal to 1.5 times the repeti- 
tion period or equivalent, three times the time interval, 77 , between 
each “Turn On” and “Turn Off” pulse for a 50 per cent duty cycle; 
that: is, 


Vil 


T. = 37- (80) 


In addition, the rise time should, of course, be considerably less than 
7, and the recovery should be such that it is overdamped or only slightly 
oscillatory. 
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Fig. 25 — Response of the blocking oscillator to a sequence of ‘Turn On” and 
“Turn Off” pulses when c = 14.3 
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Fig. 26 — Response of the blocking oscillator to a sequence of 13 ‘Turn On’”’ 
and ‘Turn Off’’ pulses when ¢ = 25. This value of ¢ corresponds to a natural pulse 
width for the circuit of 7, = 41.5. 


Physically, the above results are what one should qualitatively expect. 
When the natural width is about equal to or less than 7’, , the turnoff 
mechanism in the blocking oscillator itself will interfere with the trigger 
signal, while in the case of a large width the circuit works more or less 
like a bi-stable device, thus reproducing the incoming signal faithfully. 


XI. DESIGN EXAMPLE 


To illustrate how the results of the foregoing discussion may be used 
to develop a design procedure for the blocking oscillator, let us consider 
the following typical problem. 

A grounded emitter blocking oscillator is to be used as a pulse regen- 
erator working at a maximum repetition rate of 1.5 me. It should be 
triggered ‘‘on’” and “off” by a circuit having an output impedance of 
Ry, = 1000 ohms and it must work into a load of R, = 400 ohms. The 
available bias voltages for the base and collector circuits are Ey, —0.8 
volt and F, = —6 volts, respectively, and the stray and wiring capac- 
ity was estimated to be about 5 micromicrofarads. Available transformers 
for this application have parameters referred to the primary in the neigh- 
borhood of the following: 


Leakage inductance: L, = 1 uh, 
Magnetizing inductance: L, = 100 uh, 
Stray capacitance: Cr = 20 upf, 
Possible turns ratios: n = 1.5, 3, 4.5. 


A surface barrier transistor, 2N128, is to be used, for which typical 
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T.(%) and [Vo(%) — ELw]/R, characteristics are shown in Fig. 27(a). Its 
other specifications are as follows: 


Collector saturation current with Rk, = 400-ohm load: /,, = 14.5 ma, 
Minimum base current to saturate transistor: Jb. min = 2 Ma, 
Base voltage at the point of saturation: = 0.5 volt, 


Base input impedance in the conduction region: ry, = 60 ohms, 
a cutoff frequency: fa = 60 me, 

Collector capacitance: C, = 1.6 upf, 

Bmax 37. 


The “Turn On” and “Turn Off” trigger pulses have roughly a Gaus- 
sian shape and are spaced 0.33 microsecond apart. Their 50 per cent 
width is about 0.2 microsecond and their magnitude should be deter- 
mined. The rise time and fall time of the blocking oscillator should be 
less than 0.15 microsecond. 

The first step to be taken in designing the blocking oscillator is to 
determine what turns ratio should be used. However, the turns ratio 
affects a number of things, such as the location of operating points, 
natural pulse width, ete. in the circuit, so that no single criterion is 
sufficient to determine n. The range of values imposed upon n by the 
various requirements in the circuit must be determined first, then a turns 
ratio can be chosen compatible with all these requirements: 

i. The first of these requirements comes from the fact that the turns 
ratio must be such that the load line intersects the /(%,) curve in three 
points. This restricts n to values: 


ye lt in en OB ie tans 
ea min ~~ Ib. Ibs min fe 
where Jb. min 8 the minimum base current that will saturate the transis- 
tor. 
ii. The second requirement pertains to the desirability that n is such 
that the magnetizing inductance does not significantly affect the location 
of the quasistable operating point. According to inequality (27), this 


n? m . 4. R, ( L 4 rcs) <«K 0. 


requires that: 


(82) 


Vow — Ew Lim \w 


R, 
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Fig. 27 Design example: (a) [V»(7s) Ey»\/R» and 7.(i,) characteristics of 
2N128 transistor; (b) range of values imposed upon n; (c) 7(4,) for n = 3 


Substituting for wy and Co from (28) and (15), respectively, this eondi- 
tion may also be written in the form: 


2 | l l 
*—- RB} = - IRC, 
5 t F bs ~~ Ky | Fe (1 T ' ) ¥ 


+ fh | l 4 RUC, + Lc) | <0. 


Lim Wa 
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However, in most cases of practical interest, 


1] 1 ' ati | 

7 | + RAC, + 150.) |< , “9 . (84) 

| Wa Vis ra Ew 

Thus, inequality (83) reduces to 
i. “a 
0 = 
adi Vo = Ey” (85) 

R 


In our numerical example, condition (84) leads to 5.9 * 10° « 1.11. 
So as far as this requirement is concerned, n should be selected within 
the range: 


<n <TR (86) 


iii. In order to assure that the rise time is reasonably independent of 
Binax Of the transistor, n should, according to Fig. 13, be selected such 
that Bmax/n is not too close to unity, say, 


— 24.6. (87) 


The fourth requirement on n concerns itself with the desirability 
of a non-oscillatory recovery, which, according to (77), requires that: 
. ‘ P , 

/ 2R, / 2a “10° 5 


/ 


n = 1.00. a 
fo 4 (S88 ) 


ay + V am 


v. Finally, in the section on the multipulse response of the blocking 
oscillator it was found that the natural width should be equal to or 
larger than three times the time interval, 7'7 , between the ““Turn On” 
and ‘Turn Off” pulses in order for the circuit to reproduce the incoming 
information faithfully. According to (80), this requires that n must be 
such that: 





| Ep 
nL n Se 
— in owe > BT’... (89) 
Tbs I); Ew 

hes Lei lbs 


From Figs. 20 or 21 it is seen that this condition will limit n to a range 
of values between a minimum and a maximum. All quantities are known 
in the above equation except /,; , which, if n was known, could be deter- 
mined from Fig. 27(a). However, since the lower and upper limit of n 
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defined by the above condition approach each other when J); is increased, 
and since J,; increases as n is decreased, we only have to consider the 
vase where /,; is determined by the lowest possible value of n. Examining 
the previous restrictions on n, we see that its smallest possible value is 
1.05, which determines the point of tangency in Fig. 27(a) to be /,; = 2. 
Hence, condition (89) becomes 





l 
| 
2, 7 og ‘a (90) 
n In —— 2 0.564, 
1.138 
which gives: 
0.996 <n < 6.53. (91) 


The range of values imposed upon n by the above requirements is 
plotted in Fig. 27(b). It is seen that any value of n between 1.05 and 
6.53 satisfies all these requirements. However, remembering that the 
turns ratio should be selected small in order to reduce the rise time as 
much as possible, and taking into account that, for reasons of reliabil- 
ity, n should not be chosen close to the edges of the allowable range, the 
value n = 3, seems to be a good compromise. With this value of n the 
resulting /(7) curve is as shown in Fig. 27(c), from which it is seen 
that the base current at the quasistable operating point is J», = 3.42 ma. 
Hence, the various circuit parameters become: 


276.08 . « , 
a. ae 0008 X 10 = 73 X 10, 
Fes 14.5 (92) 
1+ D> 
Ibs 3.42 


‘ . wen 4.5 ——— 
if ia Ca + Loc. (1 + *) = o& + 1.24 1.6) (1 + : 3) - 17.6uyf, 
bs o.tse 


a = wVL.Co = 7.3 X 10'4/10-*(17.8 & 10-2) 


b = why = (7.3 X 10’)(1.76 X 10") = 0.52, 
b 


O.S84. 
2a 
The normalized trigger pulse width is 
Te = wl’ = (7.3 X 10')(2 X 10°‘) = 14.6. (94) 


From Fig. 15(c) it is seen that with these values of a and b the normalized 
rise time is about 4.6, which means that the real rise time is: 
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46 
"ee 1.6 


: . = 0,063us. (95) 
73 xX 10 ~~ » 


Also, from Fig. 16(b) it can be concluded that the overshoot is com- 
pletely negligible in this case. This is, of course, to be expected, since 
the ratio b/2a is close to unity. 

By separate measurements on the transistor it was determined that 
the equivalent w) during turnoff was equal to 3 X 10’, giving, as before, 
a = 0.13 and b = 0.21. According to Fig. 15(c), the fall time is therefore 


3.8 


T 
* 3.107 


0.127us. (96) 
Hence, both the rise and fall times are well within the specified limit. 
If this had not been the case, it would have been necessary to repeat the 
above procedure with a different trigger source, transformer or load, or 
with a different transistor. 

From Fig. 27(c) it is seen that the minimum trigger magnitude in 
the case of a very slow trigger is J; min = 0.85 and, by interpolation be- 
tween Figs. 7 and 8, the normalized magnitude for a width of 7, ~ 15 
is G 0.175. Hence, the magnitude of the applied trigger should be 
larger than: 


I, > (Tos _ I», )(G — ita ) + I, min 
~ 3.42(0.175 — 0.150) + 0.85 (97) 
~ 0.95 ma. 


If the blocking oscillator had only been turned “on” externally but 
left to its own devices afterwards, its natural pulse width would be: 


l 
2 4 + I 
a o Mme 3 =e 
ie = 7 In —_—_—__ 3.35uS, (98 ) 
506.4 1.25 
= + |] 
14.5 


where /,; was found from Fig. 27(¢) to be equal to 1.25 ma. 

The undershoot of the output voltage will be very small since the 
transformer capacitance is much larger than the output capacitance of 
the transistor during the recovery interval. 

To check how well the above calculations and the rise times predicted 
by Fig. 15 agreed with experimental results, a blocking oscillator having 
the same specifications as the one above was built and measured. In the 
case of the rise times, trigger pulses having magnitudes and widths cor- 
responding exactly to those of Fig. 15 were applied to the circuit and 
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Fig. 28 (a) through (d) output voltage of the blocking oscillator in response 


to various types of trigger signals; (e) the complete response. 


then compared to the values predicted by these figures. The results are 
shown in Figs. 28(a) through (d) and in Table IIT. In all cases the over- 
shoot was not observable, which is what one should expect from Fig. 16. 
Also, the critical triggering level in the case when a 0.2-microsecond pulse 
was applied was found to be equal to J; min = 0.9 ma. Fig. 28(e) shows 
the complete response of the blocking oscillator, from which it is seen 
that its natural pulse width was about 2.5 microseconds and the under- 
shoot less than 5 per cent. 


XIT. DISCUSSION OF RESULTS 


An approach to the design of transistor blocking oscillators with em- 
phasis on the nonlinearities inherent in their operation has been pre- 
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TABLE III 





j | | o ame P 
Actual | Actual | Rise Time, TR , in us 
| Magni Width, |— —— — 

| tude,ma | ys 


= 


Trigger Signal - 
Calculated Measured 

Slow Trigger Pulse |} 1.3 12.3 | 15 = 0.157 0.15 
G = 0.3, 7. = 166.7 [Fig. 14(a)] | | 7.3 X 10? ae 

Moderately Fast Trigger Pulse § 0.34 | 4.6 = 0.063 0.06 
G = 0.5, ry = 25 [Fig. 14(c)] | 7.3 X 107 ; 

Short Trigger Pulse 3. | 0.06} 4.65 = 0.064 0.06 
G = 1, ~ = 4.175 [Fig. 14(e)] 7.3 X 107 : 


Saturating Step Function 3. 2.3 0.03 
G = 1 [Fig. 14(f)] 3 X 107 


= 0.031 


sented. A graphical picture of the operating points of the circuit has been 
drawn which gives a useful feel for the circuit behavior. In addition, 
both the minimum trigger signal needed to cause the circuit to regener- 
ate, and the requirements for reliable triggering can be inferred from 
the static characteristic (steady-state equation). An approximate rela- 
tionship between trigger magnitude and width was determined from 
analog computations. This relationship, combined with other analysis, 
shows that the minimum energy for reliable triggering is attained when 
the transistor has a high-low current 6, large wa , small collector capacity, 
and low forward drop (a good switch) and the transformer is of high 
quality, i.e., has large magnetizing inductance and high coupling coeffi- 
cients. This is hardly surprising, but it satisfies our intuition. Most im- 
portantly, an indication is given as to how these factors affect trigger 
sensitivity. 

The rise time of the circuit is relatively independent of the maximum 
loop gain Bmax/n, and therefore also of the details of the nonlinear 7,—2, 
characteristic for trigger signals about 50 per cent or more larger than 
the minimum. On the other hand, the rise time is very much dependent 
upon the rise time of the trigger pulse. Slowly varying triggers give, as 
expected, the poorest rise time, while the minimum rise time is achieved 
with fast trigger signals that saturate the transistor immediately. The 
implications of this result are obvious: power gain and considerable pulse 
reshaping, as required in a regenerative repeater for PCM, can be pur- 
chased at the expense of a rise time significantly slower than that at- 
tainable with a sharp trigger signal. This follows from the fact that the 
positive feedback loop, which is active over a greater portion of the tran- 
sition on interval with slow trigger signals, has a deleterious effect on rise 
time. The results of the rise time calculations and the conditions for 
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monotonic response permit the designer to establish specifications on 
the circuit to obtain the desired leading edge of the output pulse with 
small overshoot. The analysis of the “On” interval shows clearly the 
instabilities in pulse width that result when this width is determined by 
the natural pulse width of the circuit. It can be seen from Figs. 20 and 
21 that the natural pulse width is critically dependent upon the circuit 
operating points, the nonlinearities and the bias. This is further empha- 
sized by the fact that the largest discrepancy between measured and 
computed results in the example occurs in the natural pulse width. When 
the pulse width must be accurately controlled, it is highly desirable to 
have this accomplished by an external trigger-off pulse. This implies a 
large magnetizing inductance for the feedback transformer and justifies 
the initial approximations that were made in the derivation of the equa- 
tion governing the transition on interval. 

During recovery the energy stored in the magnetizing inductance rings 
out. Quick recovery can be accomplished without retriggering if the 
transformer circuit is critically damped. This may be accomplished with 
the existing circuit elements, or it may be necessary to add a damping 
resistor and diode combination across the transformer.’ This has the 
effect of modifying the value of R, used in the recovery interval. When 
the circuit is clocked off, analog computer results show that the natural 
pulse width should be at least three times the desired pulse width for 
reliable performance. 

Several points regarding this analysis and its application should be 
clarified before concluding. As pointed out, the equivalent circuit chosen 
for the transistor is a necessity for a tractable analysis. Gross deviations 
from the assumed configuration may partially invalidate some of the 
results. In most cases, physical reasoning can be used to account for the 
effects of these deviations. Furthermore, this equivalent circuit is a good 
one for transistor designs to shoot for. 

Second, the design example clearly shows that the analysis presented 
does not lead to a set of design equations that can be solved immediately 
for the values of all the circuit elements. In particular, the parameter n, 
the turns ratio, is woven into all of the normalized design parameters, as 
evidenced by the design example. Furthermore, both the magnetizing 
inductance of the transformer and its parasitic capacity are dependent 
on n. The fact that there is no simple, immediate means of arriving at a 
unique set of parameters is due to this interdependence. This is typical 
of most engineering problems. However, this analysis does bring out the 
compromises required in the design. 

Finally, we believe that the ingredients employed in the present ap- 
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proach are useful in the successful execution of nonlinear circuit design. 
The recipe is as follows: apply some experiment; some intuition; some 
analysis, including minor deviations from superposition, and, finally, a 
healthy sprinkling of planned computing. 
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APPENDIX A 


Derivation of the Differential Equation 


From the equivalent circuit in Fig. 2(b) the equations governing 
blocking oscillator behavior are: 


I,(t), (99) 


ae Vi(%), ( 100) 


= I .(ib), (101) 


(102) 


(103) 
C, dv, 
n dt’ 
lv, 


— } } = 4 a ( 104) 
1 l. = 71 ( 0 9 
, “ , dt 


+ I m(t;) + 


; », — En ‘ 
— 27k, — : _ = L ; (105) 
n dt 


Substituting for v. from (105) in (104): 
di; Co doy di, 


} Cok. - LCo : O06 
1+ fi dt t n dt . dt (106) 
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and combining 102 and 103: 


. oe Ew, P = Kw 
nly + 7 ; dt + In(t;) 
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1, = 
? / -_ 
(107) 

Cr du, 

n dt 


Substituting for 7, from (107) into (106) and combining terms yields: 
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Operating upon (108) in accordance with (101), grouping like terms, 
dividing through by n and rewriting in terms of [(%) — Emw)/Rp gives 
the final form of the defining equation as: 


LC o diy R.Co\ di l diy, 
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In the above, /(2,) is defined as 
(; Mm rcs) Vz! ea Ey (110) 
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The “steady-state equation” which determines the operating points 
of the circuit is given by (109) when all derivatives and integrals are 
zero. It is 


I(%) — & — ——~— =0. (111) 
n 


From (107), the output voltage becomes: 


Vol ty) — Bw Ry —_— — Ky di 


tp n° 2D R, 


vo = Ra, = nR. E + 


(112) 


“hy T d V mA 1p ) eid Ew I m ( t ) 
R, - pe) ae ECE) I, 
ie R, ie : | 


Combining (105) and (112), the collector voltage may be written 


. Vil) — Be L,. d : 
= E., = _ (11: 
Bee n (1 TR. $V 0. 113) 


APPENDIX B 


Normalization 
Considerable simplification and savings in space can be achieved when 
the dependent and independent variables are normalized in the following 
manner: 
wot, 
te — Ih. 
Tye — Tee 
Vil) — Ew 
Ri(Les = Tc) 
I(t) 
Tus — Lec 
I .(%) 
n(L, — Loe) 


= normalized base current, 


normalized base voltage, 


= normalized trigger function, 


normalized collector current base current function, 


Egtt;) NI», 
n(1bs we Tec) 


I(%) 
D vc 


= normalized initial current in L»,, 


= normalized nonlinear characteristics. 
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Further, we define 
wo L ef "0 ’ 
wol? .C ‘0 ’ 


nLim 
| ih 
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n? Lim 
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Rk 
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Using the above definitions in (109) through (111) of Appendix A gives 
the normalized defining equations: 


3. ee. ’ 
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+ (a + be + a’e) 


=}1+ (1 +5) ¢ + (a +b) © +i 3 g(r) 
dr dr dr* 


4 
h(a) = f(x) - (1 aes ’) Via). (117) 


e 

The normalized “steady-state equation” which determines the operat- 
ing points of the circuit is given by (22) in Section IV when all deriva- 
tives and integrals are zero. It is 


l 
h(x) — x — m(r;) = f(x) - (1 + “a ) V(x) — x — m(r;) = 0. (118) 
r 


In terms of the normalized base current, 2, the stable operating point is 
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now located at x = 0 and the quasistable one at x = 1, and f(x) attains 
the same values at these two points. 
If the normalized output voltage is defined as 


Vo 1) 
NR(In, — Ive) n(lIb, — } 
the normalized version of (112) of Appendix A becomes 


dV (2x) 


+ m(r;) — g(r). (119) 
dr , 


xt Viz) + | Vix) dr + ¢ 


C dr 


In a similar way, the normalized collector voltage is defined as 


K.. — v, 
MR(Lb, — Ive)’ 


and (113) of Appendix A becomes 
kV(a) + (: t - TY y. (120) 
») dr 
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Hall Effect Devices 


By W. J. GRUBBS 


(Manuscript received October 21, 1958) 


A wealth of devices which depend on the Hall effect for their operation 
have been proposed in the last decade. This paper gives the results of a sur- 
vey of these devices. Original work in this field is included in those sections 
which describe the circulator, one-piece gyrator, switch, frequency spectrum 
analyzer, phase discriminator and digital-to-analog encoder. Semiconductor 
materials are discussed in terms of what type of material is most desirable and 
how currently available materials limit the usefulness of Hall effect devices. 


I. INTRODUCTION 


If a magnetic field is applied perpendicular to a current flow in any 
conductor, the moving charges (which constitute the current) are de- 
flected sidewise and build up a potential difference between the two 


sides of the conductor. The creation of this transverse electric field (per- 


pendicular to both the magnetic field and the original current flow) is 
called the Hall effect. During recent years, interest in this effect has in- 
creased tremendously. Before semiconductors and their capabilities were 
understood, the Hall effect in solids was little more than a laboratory 
curiosity. Now it is not only an important tool in metallurgy and semi- 
conductor device development, but it has been the mode of operation 
of many proposed devices. This article describes how 20 or so of these 
devices operate. In each case, the major advantages or disadvantages 
are mentioned, but no attempt is made actually to determine the use- 
fulness of the device. 

The devices to be discussed have been arbitrarily divided into two 
groups: devices which use a constant magnetic field and devices in which 
a signal or an oscillator produces at least a part of the magnetic field. 
Such a division is not entirely arbitrary, because the first group in- 
herently has a very high limit on the operating frequency and the sec- 
ond group has a considerably lower limit. The devices are listed on the 
following page. 


853 
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Constant magnetic field 


Gyrator 

[solator 

Negative-resistance amplifier 
Circulator 


Signal-produced magnetic field 


Switch 

Transducer 

Magnetic field meter 
Klectrical compass 

Magnetic field variation meter 
Ammeter 

Wattmeter 

Amplifier 

Modulator (and nondrift de amplifier) 
Demodulator 

Frequency spectrum analyzer 
Phase discriminator 
Digital-to-analog encoder 
Analog multiplier 


II. CONSTANT MAGNETIC FIELD DEVICES 


2.1 Gyrator 

The gyrator has received more attention! ?*-4.> than has any other 
Hall effect device — undoubtedly because it was the first nonreciprocal 
circuit element to which the electrical world was exposed. Casimir® was 
the first to point out in an English publication that, in a conducting solid, 
if Rx.(H) is the transfer impedance from terminal pair 7 to pair k in the 
presence of an orthogonal magnetic field H, then 


Ru(H) # Ry), 
but 


Ru(H) = Rii(—H). (1) 


This is simply a statement of the fact that the presence of the magnetic 
field causes the reciprocity theorem to be violated. With H = 0 


Riu = Rei. 


b 


Casimir credited Meixner’ with being the first to prove (1). 
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MeMillan® pointed out that many transducers (such as the crystal or 
condenser types) are reciprocal, but that electrodynamic or magnetic 
transducers are antireciprocal. An antireciprocal four-pole is a transducer 
in which the transfer impedance from left-to-right is opposite in sign 
and equal in magnitude to the transfer from right-to-left. McMillan 
suggested that by combining the two kinds of transducers (reciprocal 
and antireciprocal) it should be possible to produce a nonreciprocal 
transducer — one in which the two transfers have unequal magnitude. 

It may be noted that MecMillan’s antireciprocal four-pole corresponds 
to at least one definition of the gyrator and his nonreciprocal four-pole 
bears a resemblance to an isolator. The gyrator may be defined as a 
four-pole in which the two transfer impedances are equal in magnitude 
but have phase angles which differ by 180° (i.e., a gyrator is an anti- 
reciprocal four-pole). 

In a letter to the editor, McMillan® suggested the Hall effect as one 


2 

















/ 


1 








Fig. 1 — MeMillan’s antireciprocal four-pole (proposed). 


possible means of achieving a nonreciprocal electrical system. His 
scheme was to place a slab of bismuth with contacts in a magnetic field, 
as shown in Fig. 1. He selected bismuth because it exhibits a very large 
Hall effect. 

Tellegen‘ referred to McMillan’s work and went on to give the gyrator 
its name,* propose in general terms possible means of making it and 
suggest many ways in which it would be helpful in electrical network 
synthesis. 

Hogan!® used the Faraday effect to make a gyrator at microwave fre- 
quencies. Mason et al.2 reported making gyrators which use the Hall 
effect for their operation, and Wick® showed that the minimum possible 
power loss of a Hall effect gyrator is 7.66 db. 

The type of Hall effect gyrator which has been studied most thoroughly 


* The gyrator is the electrical analog of a mechanical gyroscope. 
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consists of a square slab of semiconductor with an ohmic contact in the 

middle of each edge face and a constant magnetic field H perpendicular 

to the plane of the slab (see Fig. 2). There are no junctions and the 

reader may readily convince himself that such a device is antireciprocal. 
If the Hall angle is small, it is given by 


On = tan 1 (unH x 10 mh (2) 


where uy is the Hall mobility of the semiconductor’s majority carrier in 
em?/volt-second and H is the magnetic field intensity in oersteds. (This 
approximation is valid as long as the product of the majority carrier 
mobility and density is much greater than the same product for minority 
carriers.) It can be shown (again, for small 6,) that, in the Hall effect 
gyrator with the output leads open-circuited, the output voltage is 


Mwai = V in(unH »4 10 8) (3) 


An n-type germanium gyrator has a loss of about 14 db with H = 17,500. 
If a material with higher mobility were used, the same loss would be 
observed with a smaller field or a smaller loss with the same field. Hogan'® 
refers to a vacuum tube Hall effect gyrator proposed by R. O. Grisdale 
which had four electrodes that could both emit and collect electrons. 
Such a device could have very high electron mobility. It was built and 
found to have a loss of “about 7 db”. Apparently tan 0, became so 
very high that the minimum possible loss was approached. Of course, 
this is more complicated than most users prefer, so the device was not 
developed for practical use. 

It is worth mentioning that a semiconductor Hall effect gyrator op- 
erates at direct current and, theoretically, at any frequency up to the 
dielectric relaxation frequency of the material used. 


1 




















. 2— Hall effect gyrator (built). 
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Fig. 3 — Hall effect isolator made from gyrator. 


An ideal gyrator is also an ideal impedance inverter, but the loss in- 
herent in the Hall effect gyrator makes it useless as an inverter. This 


loss is of such a nature that it could be effectively removed by placing 
negative resistance of the proper value in series with the gyrator. This 
lossless or ideal gyrator should then find application in network synthesis. 


The gyrator has been discussed here in some detail not only because 
of its historical interest but, primarily, because it has been the basis of 
the major portion of the devices to be described in the remainder of this 
article. 


2.2 Isolator 


An isolator?*:* is a nonreciprocal four-pole in which one of the two 
transfer impedances is zero. Thus it transmits signals in one direction 
only — say, from terminal pair 1 to pair 2. Such a device can be made 
from a Hall effect gyrator by the addition of two shunting resistors (see 
Fig. 3). Assuming the gyrator is symmetrical, its two self impedances 
may be called z, and its transfer impedances 27; and zz. Furthermore, 


2n = —2r2 = 2r > Q. 


Then it can be shown that the parallel resistance values R; and R, must 
be such that 

ze t+ ar 
R, + RR. = : 


in order to obtain an isolator. 

Since z7/z, = tan 6,, it may be seen that, with R; and R, fixed in 
value, to maintain isolation one must maintain a constant 7, a constant 
va and, therefore, a constant temperature. Also, if the resistance of the 
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semiconductor contacts is current-sensitive, isolation might be possible 
only in a limited range of signals. The forward loss of such an isolator is 
just slightly less than that of the gyrator from which it is made. The 
minimum possible forward loss is 6 db, and the loss in experimental 
germanium isolators has been approximately 14 db. Thus, to transmit 
1 mw of power, one must apply 26 mw to the input because 25 mw will 
be dissipated in the isolator, most of it in the germanium. This is another 
source of current sensitivity — changing the current level changes the 
amount of power dissipated in the isolator, changing its temperature 
and therefore 2, , zr/z,, R; and R,. The higher the reverse loss is (i.e., 
the better the balance) the more sensitive the isolator will be to all such 
variations. The reverse loss in germanium isolators has been made 75 
db, and this loss is quite sensitive to small variations. 

When indium antimonide (which has a much higher yz) was used, the 
forward loss was reduced to about 7.5 db, even with a smaller H. With 
carefully adjusted shunt resistance values, a reverse loss of “the order 
of 100 db” was obtained. Notice that the forward loss was very nearly 
the minimum possible loss. Unfortunately, InSb has a quite small energy 
gap and consequently a very high np product. This fact, in conjunction 
with its very high mobility, inevitably gives rise to a very low resistivity. 
Thus, the InSb isolator had an impedance of around 1 ohm. Germanium 
isolators can easily be made with impedance levels ranging from 10 to 
1000 ohms. 

Typically, uz in InSb is a much stronger function of temperature than 
is wz in germanium. Thus, this low-loss isolator has some serious draw- 
backs. 

There is another way of making a Hall effect isolator, and it has been 
suggested that this second method should remove the temperature 
sensitivity. Unfortunately, this is not the case. This form of the isolator 














Fig. 4 — Hall effect isolator (or skew gyrator). 
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Fig. 5 — Negative resistance amplifier. 


uses no resistors and has been called a “skew gyrator,” in which the 
sample is square but the leads are not symmetrically spaced. Fig. 4 shows 
the sample with the dotted equipotential lines which would occur with a 
signal applied to side 1 if the magnetic field were the proper value. Ob- 
viously, no voltage appears between leads 2 and 2’. If a signal is applied 
to side 2, there will be an output at side 1. This is stated symbolically as 


k +0 2 > I. 


This type of isolator is still sensitive to variations in H, yy and tem- 
perature (and, therefore, current level). This sensitivity arises from the 
need for a constant 6, to maintain the slope of the equipotential lines. 
Of course, if the semiconductor has the proper doping level, so that uy 
is independent of temperature near room temperature, this device could 
be quite stable. For that matter, most of these devices could be stable 
with such a constant mobility material. 

As for the shunt-resistor isolator, it is conceivable that resistors could 
be used that had the proper temperature coefficient for this isolator also 
to be made stable — perhaps even more stable than the skew gyrator. 


2.3 Negative-Resistance Amplifier 


The negative-resistance amplifier®" uses a slice of semiconductor 
with three equally spaced edge contacts, a perpendicular magnetic field 
of constant value H and three negative resistances (see Fig. 5). For the 
moment, assume the parallel conductances are not in the circuit. The 
short-circuit admittances may be defined as 
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vy 
Yu = _ yu = es 
V1 lve 


1 
on = Yor 
Ue |uj=0 


A parameter @ (an odd function of 6,) may be such that 


1 U 
Y2 = _% (] — a), Y21 = -” (] +- a) < l. 


However, if the parallel admittances are connected and given the 
values 


Y= Ys = Yu, 


¥2 = 2, 


it can be shown that 7; = 0 and 72 = ayou, . Therefore, with the negative 
conductances in parallel, the device becomes an isolator which can have 
gain in the forward direction. One of its advantages is that it permits 
the construction of negative-resistance amplifiers which are unidirec- 
tional, so that higher gain with the same degree of stability is possible 
with this type of device than with the usual two-terminal negative re- 
sistance. 

Negative resistance obtained from gas tubes was used in this way to 
give a forward gain of 6 db and a reverse loss of 46 db. 

This type of device was proposed as a high-frequency amplifier 
because of the inherent insensitivity to frequency of Hall effect gyrators 
and related devices. Just how good they are at very high frequencies is a 
question which will be discussed a little later. 

The same principle may be applied to the shunt-resistor isolator. Still 
another form of negative resistance amplifier employing Hall effect 
will be mentioned in the next section. 


2.4 Circulator 


A circulator is a nonreciprocal n-port device (n > 2) in which a signal 
applied at one port is transmitted only to an adjacent port (e.g., the 
adjacent clockwise port). No output appears at any other port. A three- 
port circulator can be made which uses the Hall effect. As one may ob- 
serve from the skew gyrator, the Hall effect merely enables one to pre- 
vent any signal from appearing at one of the outputs. Therefore, if we 
had four or more ports, a portion of the signal would appear at all ports 
except one, and the device would not really be a circulator. See Fig. 6 
for the symbol of a three-port circulator. 
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3 


Fig. 6 — General three-port circulator. 


The Hall effect circulator’: consists of a circular slab of semicon- 
ductor (n-type germanium has been used to date) with six equally 
spaced edge contacts and a constant magnetic field H applied perpen- 
dicular to the plane of the slab. Such a sample is shown in Fig. 7 with 
equipotential lines dotted in. With the proper value of H, no output 
appears at 3 but there is an output signal at 2. The results would be 
similar for an input at 2 or 3 as well. So we may say 


1-2-—-3-1, 
3H241+4 3. 


When a load is attached so that a current may flow at 2 (with the 
signal applied at 1) this secondary current flow produces an additional 
Hall electric field which has the net effect of reducing 0, . To maintain 
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Hall effect circulator. 
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circulation, H must be increased. Thus, it is obvious that the circulator 
must be terminated in the proper impedances if its circulation property 
is to be retained. At a particular magnetic field, Ho , the load impedance 
necessary to “balance” the circulator is the same as the circulator’s input 
impedance, so Ho is the logical field to use to avoid impedance mis- 
matches. 

When n-type germanium is used, Hyp © 14,500 oersteds, and the 
matching load impedance, Z,, is a function of the sample’s thickness 
and resistivity, and might vary from 10 to 1000 ohms. If n-type InSb 
were used, Hy would be in the order of 1000 oersteds, but (as with the 
isolator) Z, would have to be quite low — 1 ohm or even less. 

It is almost impossible to make a circulator which has the proper 
impedance level and is symmetrical so that it will circulate a signal 
applied at any input. Fortunately there is a means of overcoming this 
difficulty so that a circulator of the type which is quite easily fabricated 
may be made to appear symmetrical and to operate at the proper imped- 
ance level. All that is involved is placing a network of six resistors in 
parallel with the circulator (see Fig. 8). Use Cp with H < Ho and C, 
with H > H,. Apparently the three-resistor networks shown in Fig. 9 
can be used equally effectively but, since Cp and C, have six variables, 
they should offer greater flexibility than does C,’ or either form of C,’. 

The parallel networks permit Z, to be adjusted to any value between 
about one tenth its normal value and infinity (open-circuit), using H = 
Ho all the while. One can also use the normal value of Z, with any H 
from zero to infinity. Another possible function of the networks is to 
permit an approximately symmetrical circulator to be used with unequal 
loads at the three-ports. 

The forward loss of a circulator is about 17 db if it is used without 
parallel networks, and reverse losses of about 65 db have been achieved. 
With n-type germanium, H must be about 14,500 oersteds. If InSb were 


— 
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Fig. 8 — Parallel networks for circulator. 
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Fig. 9 — Alternative parallel networks. 


used, H would be only about 1000. However, the forward loss would 
still be 17 db, since the loss is determined by 6, , and 6, must be the 
same in any Hall effect circulator. 

On the other hand, if one uses network C, and increases H, the forward 
loss is reduced. With n-type germanium, it has been reduced to about 15 
db. Judging by the low loss achieved with an InSb isolator, one would 
expect that an InSb circulator could be made with a forward loss within 
one db or so of the minimum possible loss. The minimum loss has been 
salculated to be 8.4 db, so an InSb circulator probably could be made 
with a 9- or 10-db forward loss. The reverse loss would depend solely on 
how precisely the resistance values of C, were adjusted. Of course, InSb 
still has the disadvantage of very low resistivity, and the impedance 
level would be about 1 ohm. As previously stated, such a circulator could 
be operated with any higher impedance load, but the necessary mismatch 
would increase the insertion loss tremendously. 

Perhaps another means of reducing the loss should be considered 
that is, using negative resistance in Cp or C,. By this means, the loss 
could be removed, or gain could even be achieved, and one would have 
an amplifying circulator. Negative resistance is still hard to get, but if 
someone knew of a good means of producing it, he could make the cir- 
culator an even more interesting device. 


Ill. GENERAL REMARKS 


All the devices described in the preceding pages could theoretically 
transmit de signals as well as ac signals of any frequency up to the di- 
electric relaxation frequency of the semiconductor material. In order to 
realize even a major portion of this huge bandwidth (~ thousands of 
megacycles) special care must be taken to shield all input and output 
leads properly. The samples should be small, so that only a small magnet 
will be required. Consequently, all the leads must be fairly close to one 
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Fig. 10 — One-piece gyrator. 


another where they attach to the sample. If the lead problem can be 
overcome, the devices will remain resistive (and essentially insensitive 
to frequency) until the relaxation frequency is approached. If leads are 
treated carelessly, the top frequency may be less than one megacycle. 

For linear operation, the contacts must be as nearly ohmic as possible 
and contact resistance must be minimized. Nonlinearity might be a 
serious problem with any of the above devices if a wide dynamic range 
of operation were required. 

If a good permanent magnet material were known (which also had a 
respectable Hall mobility) the above devices could be fabricated from a 
single piece of this material (see Fig. 10). The torus would be made of 
the unknown material, and it would be magnetized. The figure shows 
four contacts equally spaced about the torus’ body, so the pictured device 
would be a gyrator. It could also be an isolator or a circulator. Two ad- 
vantages of this structure would be that it would be all in one piece (for 
mechanical simplicity and rigidity) and that there would be no air-gap 
in the path of the magnet. 


IV. SIGNAL-PRODUCED MAGNETIC FIELD DEVICES 


1.1 Switch 


Hall effect switches® for the most part are not simply single-pole 
single-throw switches. The simplest Hall effect switch might be termed 
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a double-pole single-throw switch. This might be a gyrator sample in 
the gap of an electromagnet. Let us call the two terminal pairs of the 
gyrator 1 and 2. When there is no magnet current flowing, 1 is not con- 
nected to 2. But, with the magnet energized, 1 is connected to 2 and the 
loss introduced by the switch “contacts” is the insertion loss of the 
gyrator. 

One can make a more intriguing switch by using an isolator instead 
of a gyrator. With no magnetic field, this is not very interesting, because 
1 and 2 are then merely reciprocally connected very inefficiently. But 
with the proper value of H, 1 is connected to 2 but 2 is isolated from 1 
(1 > 2+ 1). Reversing the field reverses the direction of isolation. Fur- 
thermore, if the isolator is constructed from a gyrator plus negative 


resistances, so that the isolator is a unidirectional amplifier, switching 
the magnetic field changes the direction of amplification. It is as though 
a broadband amplifier could be physically turned around by energizing a 


coil in the opposite sense. 

An even more elaborate switch involves switching a circulator. It was 
first proposed that a signal be applied to terminals 1 of a circulator and 
that it be switched between output 2 and output 3 by reversing the 
direction of H. However, when this is done, no less than three “isolator 
switches” are reversed at the same time because any two terminal pairs 
in the circulator make up an isolator. 

Unfortunately, none of the above-mentioned switches are extremely 
fast in their operation. They all involve collapsing a rather sizable mag- 
netic field and reproducing it in the opposite sense. This requires a par- 
ticular amount of energy and, therefore, either a lot of time or a lot of 
power. There is a faster, more elegant means of switching, but it presents 
certain difficulties. 

This second means permits the magnetic field to remain constant, but 
requires the resistance values in the parallel networks to change. The 
network values must be negative for at least one of the conditions and 
could be either positive or negative for the other. This might be achieved 
by using a fixed negative resistance in series with a positive resistance 
whose value could be shifted or switched from one desired value to 
another desired value. For example, one might use three resistors, one 
negative and two positive, and have a solid-state switch across one of the 
positive resistances. By shorting or opening the switch, the Hall effect 
switch could be operated. 

Thus, an isolator switch could be reversed by opening or closing two 
electronic switches. A circulator switch could be operated by opening or 
closing three electronic switches. One calculation on a particular circu- 
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lator indicates that, by changing the resistance values in C, from —304 
ohms to —168 ohms, the circulator could be switched. In this instance 
the forward loss in one condition is 4.5 db and in the other condition 
there is a forward gain of 3 db. 

The reverse loss in each condition should still be of the order of 50 to 
60 db. The only real difficulty is in obtaining the negative resistances. 
It should be noted that amplifier connections could be reversed with 
electronic switches, and the Hall effect devices would not be needed. 
Only the circulator switch performs a new function. Perhaps it would be 
useful. 


4.2 Transducer 


Hall effect transducers* can be constructed which convert mechanical 
motion into electrical signals. The mechanical motion moves a gyrator 


sample in and out of the air gap of a permanent magnet. Such a device 
could be useful in measuring strain or other displacements. It has been 
estimated (admittedly optimistically) that an InSb transducer could 
detect a displacement of about 1 angstrom if the gradient of H were 10 
oersteds/micron. 

The same device could be used as a phonograph pickup or as a micro- 
phone. It should be interesting here because it responds to very low fre- 
quency — even direct current. 

A microphone and strain gauge have been constructed, and they be- 
haved more or less as expected. 


4.3 Magnetic Field Meter 


The Hall effect has been used to measure magnetic field strengths for 
about ten years.?*41516 Since the output voltage of a Hall effect 
sample is proportional to the product of the sample input current J and 
the applied H, if J is constant the output voltage is proportional to H. 
For some years now, there has been a commercial instrument based on 
this principle on the market. An alternating current is applied to the 
sample so that the output is this ac carrier modulated by the magnetic 
field. Thus, either slowly varying direct current or low-frequency ac fields 
can be measured. The probe is about 25 mils thick (and could be made 
thinner) so that the field even in a thin air gap can be measured. Field 
strengths from a few oersteds up to 30,000 oersteds can be readily meas- 
ured with a germanium probe. With care, even smaller fields are measur- 
able. InSb can supposedly detect fields as low as 10-* oersteds, but that 
sensitivity has not yet been achieved. However, an InSb sample can 
easily measure the earth’s field. 
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Fig. 11 — Electrical compass. 


The Hall effect magnetic field meter is useful in measuring air-gap flux 
densities, flux distributions (because of its small size), demagnetization 
and hysteresis curves, and several other parameters. Its accuracy is 
inversely proportional to the temperature coefficient of carrier mobility 
in the semiconductor, so something like germanium should be of more 
general use than InSb. Nevertheless, InSb definitely gives a greater 
output for the same field, and its use is indicated where small fields or 
small variations are to be determined. It can be shown that 


power out 


2 9 
a Tu ye 


power in 


Thus, InSb has an obvious advantage when H is small. 


4.4 Electrical Compass 


A Hall effect electrical compass* can be constructed as shown in Fig. 
11. The rods are made of high permeability material and serve to con- 
centrate the earth’s field on the sample, thus increasing the sensitivity. 
Since the output is proportional to 7 & H, the reading will be maximum 
(say positive) when the rods are in line with the earth’s field in one direc- 
tion, and maximum negative when the rods are in line in the opposite 
direction. When the rods are perpendicular to H, there will be a null. 
Using an InSb sample, with no amplification, a rotation of 1° from the 
null position is detectable. With amplification, the sensitivity could be 
increased. 

In this device, a change in temperature would only affect its sensitivity 
slightly if it were used at a maximum or minimum position. 


4.5 Magnetic Field Variation Meter 


If two Hall effect samples are connected in series as shown in Fig. 12, 
and if they are identical samples, there will be no output voltage except 
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Fig. 12 — Magnetic field variation meter. 








when the average field in sample 1 is different from the average field in 
sample 2." If the field is uniform, the two outputs will cancel. Such a 
device is useful in determining magnetic field gradients, and has been 
used to detect cracks in metals. To do this, a permanent magnet pro- 
duces a field in the metal, and where there are surface cracks (perhaps too 
small to be seen with the naked eye), there will be a stray magnetic field 
at the metal surface. Such a localized field can be readily detected with 
this type of device. 

If a single Hall effect sample were used, a positive and then a negative 
pulse would appear in the output for each crack. With the two samples in 
series and properly spaced, the two pulses can be additive. 


1.6 Ammeter 


A Hall effect ammeter* can be constructed in at least a couple of ways. 
If one uses an ordinary gyrator (with a permanent magnet field) and 
connects the input leads in series with a current-carrying line, the gy- 
rator’s output voltage will be proportional to the line’s current. If the 
current is too high for the gyrator sample, the gyrator input can be 
shunted with a calibrated resistor. One advantage of this is that it per- 
mits one to measure very high frequency currents. This hardly seems 
preferable to inserting a simple resistor in the line and measuring the 
voltage developed across it, but this Hall effect ammeter is mentioned 
because it will be referred to in the next section. 

Another and perhaps more useful form of ammeter is shown in Fig. 13. 
A yoke of ferromagnetic material is hinged at the bottom so that it can 
be clipped around a current-carrying conductor. A Hall effect sample is 
in the magnetic path, with a direct current applied to two of its ter- 
minals. If there is either a direct or alternating current flowing in the 
conductor, there will be a corresponding output voltage proportional to 
the current, since the magnetic field is proportional to the conductor 





HALL EFFECT DEVICES 


CURRENT-—- 
CARRYING 
CONDUCTOR 


Fig. 13 Ammeter. 


current. This device should be useful when it is undesirable or impossible 
to break a line to insert an ordinary ammeter. It should be particularly 
useful for direct currents, because there is no similar instrument which 


measures direct currents. 


4.7 Wattmeter 


A Hall effect wattmeter®'’'? is easily obtained by using the first 
ammeter described above and employing the voltage between the two 
conductors to energize a coil wound on the magnetic core. Thus, the 
sample’s input current is proportional to the cireuit’s voltage, and the 
output voltage is proportional to the power transmitted through the 
conductors. This is a simple wattmeter, but it is not useful at high fre- 
quencies. 

A wattmeter can be used on a coaxial line by connecting the Hall 
effect sample from the center conductor to the shell so that the plane of 
the sample includes the axis of the line (see Fig. 14). Thus, the magnetic 
field is supplied by current in the central conductor, and the output 
voltage is proportional to the power. 

Due to its high-frequency capabilities, the Hall effeet has been pro- 


posed as a means of measuring power in waveguide circuits. In this case, 
the FZ field provides the current to the sample and the // field supplies 














14 Coaxial line wattmeter 
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the H field. The output leads are brought out on equipotentials through 
holes in the sides of the waveguide. It is theoretically possible to meter 
power in this manner to quite high frequencies. 


4.8 Amplifier 


If a direct current is supplied to the input of a Hall effect sample which 
is in the gap of an electromagnet, the setup can be an amplifier.*?° If 
the magnet is efficient enough, and if the sample’s input current is large 
enough, the output power from the sample can be greater than the power 
applied to the magnet. With InSb, a gain of 5 has been achieved. Greater 
gains are probably attainable. 

Such an amplifier could be used for direct or alternating current. The 
upper frequency limitation would be determined by the quality of the 
magnetic core and the stray capacitance of the core winding. This type 
of amplifier seems to offer little or nothing that a more conventional 
amplifier would not offer, but it is interesting in that it requires a low- 
voltage, high-current power supply — perhaps the type of thing thermo- 
electric generators could furnish. 


4.9 Modulator 


The Hall effect can be used to make a product modulator.? If one 
signal is applied to the input leads of a Hall effect sample and another 
signal is applied to a winding on a magnetic core to produce the mag- 
netic field, then the output voltage is proportional to the product of 
the two signals. This type of modulator has been proposed as a substi- 
tute for the mechanical chopper found in most sensitive de amplifiers. 
The output voltage of a straight de amplifier drifts with time and tem- 
perature because of instability of the components. Mechanical choppers 
are used to convert the de input to an ac signal, which in turn can be 
amplified with much less drift. The Hall effect modulator has an inher- 
ent drift-free zero set and can convert de to ac with no mechanical mo- 
tion. Thus, it should be longer lived than relay-type choppers. 

Furthermore, mechanical choppers usually operate at 60 cps, whereas 
the Hall effect modulator frequency may easily be 1000 cps. This per- 
mits a broader band de amplifier to be realized. Direct current signals 
as low as 20 microvolts (across 400 ohms input impedance) have been 
successfully amplified, and a dynamic range of 70 db has been observed. 
If additional effort were applied to reducing pickup in the output cir- 
cuit, the 20-microvolt minimum could presumably be further reduced. 
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4.10 Demodulator 


A Hall effect square-law demodulator or detector is precisely the 
same as a device referred to as a Hall effect full-wave rectifier elsewhere.’ 
If an incoming signal represented by J = Jo cos wt is applied to both 
the Hall effect sample input leads and also to the magnet winding, the 
output voltage will be 


9 


k . k Oo 
= us + u cos 2uwt, (4) 


V 


where k, is a constant involving the Hall constant Ry and the various 
parameters of the magnetic circuit. The first term in (4) is a de term 
proportional to the square of the amplitude of the input signal. The sec- 
ond is a double-frequency component which can be easily filtered out. 
Such a detector should give an accurate square conversion for a wide 
range of amplitudes. 


A linear Hall effect detector?! may be constructed on the same prin- 


ciple if a sine wave of constant amplitude (and the same frequency w) 
is applied to the magnet winding. Then (4) becomes 


= kel o kel o 
Vm tt 


308 2wt. 

The first term in (5) is proportional to the first power of the signal am- 
plitude, and so this device should give a de output voltage proportional 
to the input signal amplitude. Again, the conversion ratio should be 
linear for a wide range of amplitudes. 

One difficulty with both these demodulators is that they are operable 
only up to frequencies at which magnetic fields of the order of 1000 oer- 
steds or greater can be sinusoidally reversed with the power that happens 
to be available. 


4.11 Frequency Spectrum Analyzer 


A highly selective frequency spectrum analyzer* can be made from 
the linear Hall effect demodulator. If the signal applied to the sample 
is J, cos wt and the signal applied to the magnet winding is of frequency 
wy , the usual sum and difference frequency components appear in the 
output: 
kel 


cos (w, — wa)t + > cos (w, + wa). (6) 


= ke ( 


») 


It may be noted that (6) reduces to (5) when w, = wy = 
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Suppose the signal applied to the sample is composed of many fre- 
quency components and that the amplitude of the magnetic field is 
held constant as its frequency is changed. If the output is applied to a 
de voltmeter, there will be a reading only when wy coincides with some 
w, , SAY w,;. Actually, the de output is given by 


kale 


9 


V ae = cos (Py tis ®,;), cr) 
where k; is a constant and (®, — ,;) is the difference between the phases 
of the magnetic field and the ith component of thé input. Thus, unless 
the phase difference is known, it is impossible to determine /,; . There- 
fore, it is recommended that wy be adjusted to within about | eps of 


” 


w,; SO that the ‘de’ output is a l-eps cosine wave whose argument 
(b, — *,;) either increases or decreases 27 radians per second. If this 
output is applied to a zero-centered de voltmeter with a response time 
r < 0.25 second, the meter needle follows the voltage variation, and 
the amplitude of the 7th component is proportional to the maximum 
swing of the needle. 

Suppose the frequency of the next component is ws. If (ay — wae) is 
large enough so that 


2r T 


~ 


WH — Wek 10 


then the meter will not respond to this component’s contribution to the 
output. Thus, by slowly sweeping wy , 2 frequency spectrum analysis of 
any input wave can be obtained. 

This device responds with equal accuracy to both small- and large- 
amplitude components. Its upper frequency is limited, because a sinus- 
oidal magnetic field must be obtained at that frequency. Fortunately, 
the signal does not have to supply this field; a high-power oscillator may 
be used. Since the available output level is determined by the component 
amplitude and the value of magnetic field, the upper frequency will be 
determined by the required sensitivity. Perhaps its upper limit in a 
typical use would lie between 0.5 and 5 me. 

Of course, it operates very well down to zero frequency, and this is 
probably where it will find its greatest use — at audio frequencies and 
even lower. Conventional analyzers require huge tuned filters at these 
frequencies to obtain the Q required for high selectivity. With the Hall 
effect analyzer, all that one requires is a single low-pass filter. Notice 
that the amplitude of the output is independent of the frequency differ- 
ence (@y — w,:). Thus, a single low-pass filter of adjustable upper fre- 
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quency gives one a very simple means of adjusting selectivity without 
affecting the sensitivity. 
With a crude device like this, it was quite easy to measure the first 


ten harmonics of a square wave of 25 eps fundamental frequency. With 
sare, 18 harmonics were detected. This device does not represent the 
ultimate in accuracy, but it may do so in small size, simplicity and ease 
of operation. 


4.12 Phase Discriminator 


The Hall effect phase discriminator simply makes use of the phase 
sensitivity of the spectrum analyzer. It is sometimes necessary to ob- 
tain an electrical signal which is a measure of the difference between the 
phases of two signals of the same frequency. If these two signals are 
applied to the spectrum analyzer, the output is a de component plus a 
double-frequency component [see (5)]. The de term is given more com- 
pletely by (7). If the two amplitudes are constant (not necessarily equal), 
the de output can be calibrated to give the phase difference directly. 
Like the analyzer described above, the frequency limit will be deter- 
mined largely by the required sensitivity. Perhaps this limit will be 
somewhere between 1 and 10 me. This is estimated to be higher than 
that for the analyzer, on the assumption that the signal level can be 
higher in the discriminator (since it needs to operate at only one level). 

Incidentally, this phase sensitivity must be considered if one constructs 
the wattmeter or demodulator previously referred to. 


4.13 Digital-to-Analog Encoder 


The Hall effect digital-to-analog encoder is obtained by applying the 
maximum allowable de voltage to the sample input leads and using n 
separate windings (to encode n binary digits) on the magnetic core (see 
Fig. 15). Possibly one extra winding would be used for zeroing purposes. 
Current limiters assure that the current input levels are fixed, all at the 
sume level; the digits are weighted by different numbers of turns in the 
windings. If the current limiters do not pass precisely the desired current 
the situation might be improved by adjustment of turns. All the input 
circuits may be completely isolated in a de sense. A disadvantage is 
that the minimum time in which the magnetic field can be changed is 
limited to something on the order of a millisecond, so that only about 
1000 binary numbers per second may be applied. Since this is the same 
setup as the Hall effect amplifier, it is conceivable that the encoder might 
introduce gain. 
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Fig. 15 — Digital-to-analog encoder. 


4.14 Analog Multiplier 


The Hall effect analog multiplier* *° is essentially a Hall effect demodu- 
lator. A direct current is driven through the sample input leads and an- 
other direct current is applied to a winding on the core. The output volt- 
age is proportional to the product of these two signals. Thus the device 
is an analog multiplier. Ref. 22 concluded that silicon offers the best combi- 
nation of properties for this device. In order to realize a reasonable accu- 
racy in the output (about 0.3 per cent) it was necessary to have a response 
time of approximately | millisecond. The error was reduced slightly (to 
about 0.1 per cent) when the input repetition rate was decreased to 20 
cps. Ref. 22 also presents a scheme for minimizing temperature effects. 

V. MATERIALS 

Strange as it may seem, all of the above devices place approximately 
the same requirements on the semiconductor material used. Generally 
speaking, the ideal material would have a high-electron Hall mobility 
and a large energy gap, and both these parameters would be constant. 

If a material with these properties were available, minimum losses 
would be realizable with rather small magnetic field strength — say, 1000 
oersteds. Furthermore, sample resistances would be fixed by their geom- 
etry and doping level. Of course, other requirements could be laid down, 
but they are certainly of less importance than the two given above. 

Now, these two requirements are mutally incompatible. That is, a 
material with a high mobility uy, is almost certain to have a small energy 
gap, Eg, and, vice versa, a material with a large Eg will have a small uy . 
If this relationship did not exist, many Hall effect devices would find 
much wider application than they now enjoy. 

Intermetallic semiconductors offer the best hope of a partial solution 
to this problem. They show a wide range of properties but still follow 
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the trend of combining small Eg with high uy, . One material however, 
gallium arsenide, has a mobility of 6000 cm?/volt-second (higher than 
that of germanium) and an energy gap of 1.35 electron volts (higher 
than that of silicon). Let us hope that other materials will be found which 
will violate the (ug — Eg) trend even more definitely and that they will 
be relatively easy to fabricate in highly pure single-crystal form. Refs. 
23, 24 and 25 describe a large number of semiconductors (both elemental 
and intermetallic) and their properties and refer to the apparent rela- 
tionship between yy and Eg. Ref. 26 gives experimental information on 
mercury selenide, a II-VI compound. 

Before leaving materials it is worth mentioning that Ref. 5 develops 
an elaborate mathematical solution to the problem of finding the electric 
field distribution in a Hall effect sample. Several different shapes are 
treated, including the square gyrator, the skew isolator and the circu- 
lator. Ref. 27 suggests a method of measuring resistivity and Hall effect 
in flat samples of any arbitrary shape. Ref. 28 gives an analog method 
for obtaining the same two parameters; this last method involves no 
analytical computation. 


VI. CONCLUSIONS 


The Hall effect has thus far furnished a tremendous variety of devices. 
All of these devices have at least one important advantage over most 


semiconductor devices — Hall effect devices require the use of majority 


current carriers only and therefore involve no junctions. There are no 
areas of concentrated electric fields, so the surfaces need not be elabo- 
rately protected. Most or perhaps even all of the devices can be operated 
indefinitely in ordinary indoor atmospheres with no protection whatever. 

Because these are majority carrier devices, the lifetimes of minority 
sarriers are of no consequence. When an electric field is induced in a 
semiconductor, a brief time elapses while the current carriers redistrib- 
ute themselves. This relaxation time (ordinarily less than a millimicro- 
second) is the only factor which limits the frequency of signals which 
‘an be satisfactorily transmitted through a Hall effect sample. Interlead 
‘apacitance will tend to shunt part of the signal around the sample and 
will thus limit the frequency range of operation more severely. Of course, 
where a signal supplies the magnetic field, the frequency limitations will 
depend on the core used, the winding, the signal level and the sensitivity 
and linearity required. 

The Hall effect is a small effect. As a result, the devices it makes pos- 
sible are inefficient or lossy. In order for Hall effect devices to be stable, 
the majority carrier mobility must be held constant, as must the magnetic 
field (or core permeability as the case may be). In order that these devices 
have linear characteristics, all current-carrying contacts must be ohmic. 
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Despite the disadvantages outlined in the preceding paragraph, Hall 


effect devices are being studied with steadily increasing interest. 
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An Appraisal of Received Telephone 
Speech Volume 


By O. H. COOLIDGE and G. C. REITER 


(Manuscript received November 3, 1958) 


One of the attributes of telephone service which is of importance to a tele- 
phone user ts the loudness with which he hears the voice of a distant talker. 
Related to this loudness is an objective measurement of “received volume.” 

This paper represents the results of subjective tests made to determine a re- 
lationship between received volume and the satisfaction of telephone listeners. 
The results are shown as statistical distributions of listeners’ opinion, which 
may be combined with estimated distributions of received volume in the 
telephone plant to give ‘‘grade of service.” 

Grade of service objectives have been stated as 95 per cent of connections 
rated ‘‘Good”’, 5 per cent “Fair” and a negligible percentage ‘‘Poor.” In- 
creased use of the more efficient 500-type telephone sets, and planned improve- 
ments in the circuits which interconnect them, will make it possible to meet 
these objectives. 


I, INTRODUCTION 


Outside of his monthly telephone bills, an occasional visit to the Tel- 
ephone Company commercial office and now and then a brief telephone 
conversation with the telephone operator, the customer’s only contact 
with the telephone system is his telephone set. He knows that the wires 
and cables he sees strung on poles and the telephone building he oceasion- 
ally passes on his way downtown bear some relation to the telephone in 
his home or office, but this is not very important to him. The important 
thing is his telephone set. Through it he can communicate with anyone 
else who has access to a telephone, no matter how far away that person 
happens to be. 

Now what does the user expect from his telephone? We might say 
that he wants it to be reasonably pleasing in appearance, comfortable to 
use and simple to operate. He expects accuracy; that is, he wants to be 
connected to the party he calls and not to some stranger, and he does 


877 





878 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1959 


not want to be annoyed by answering ‘wrong numbers”. Our typical 
customer is impatient when his call is unduly delayed without an expla- 
nation as to the reason for the delay. He wants a telephone bell or other 
signal which can be heard in all parts of his home but which is not objec- 
tionably loud if he happens to be near the telephone when it rings. He 
will object to ‘clicks’ or other loud or unpleasant noises from his tele- 
phone. He is annoyed if he can hear “crosstalk” which is intelligible or 
nearly intelligible, because it implies violation of the privacy he expects 
in his own telephone conversation. He wants to hear in his telephone re- 
ceiver a reasonably faithful, undistorted reproduction of the voice of the 
speaker. Finally, the most important attribute a customer expects of his 
telephone set — important because, without it, he might find no reason 
to have a telephone at all — is that it permit him to hear and to be heard 
with a minimum of effort on his part. 

It is with this last attribute that the present paper is concerned. 
Briefly, if a customer hears the voice of his partner in telephone conversa- 
tion with sufficient loudness and freedom from distortion and noise, lit- 
tle effort will be required to hear and understand what is said. If the dis- 
tant voice is weak or distorted, the strain of listening requires effort. 
Likewise, if the distant party hears only with effort he, in turn, may re- 
quest our typical customer to talk more loudly, which again calls for ef- 
fort. In telephone systems of modern design, distortion and noise are no 
longer troublesome factors, so we shall consider only the loudness aspect. 
Thus, this paper is concerned with the determination of the magnitudes 
of received volume, or loudness, which give varying degrees of satisfac- 
tion to users, and with obtaining an estimate of how well the grade of 
transmission service provided by the Bell System meets the objective of 
satisfying customers in this respect. 

What the customer would like to have and what it is economically 
feasible to give him may not be entirely compatible. Giving him what he 
wants has not always been feasible in the past, but, with the more gen- 
eral application of the improved telephone facilities that have become 
available over the past decade, this now appears within the realm of pos- 
sibility. 

What do we mean by ‘‘what he wants”? We must bear in mind that 
people vary widely in their judgment of preferred loudness, just as they 
do in their judgments involving their senses of feeling, sight, taste or 
smell. This is therefore a statistical problem involving the likes and dis- 
likes of a large number of people. Their combined judgment might be 
expected to approximate (except at the tails) some distribution related 
to the normal law. A small number of telephone users will be quite toler- 
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ant of low received volumes, and another small number at the opposite 
end of the distribution will desire very loud volumes. But the large bulk 
of the telephone users will prefer some loudness level about midway be- 
tween these extremes. It is the job of the engineer to find out by experi- 
ment that area of received volumes wherein satisfaction for the greatest 
number lies. 

To carry out such an experiment with the telephone-using public would 
be a prodigious and impractical undertaking. However, by sampling 
methods and by closely controlled experimentation one can obtain an 
answer in the laboratory. The first thing to be done is to decide on a 
yardstick for expressing degree of satisfaction. Testing procedures are 
then set up whereby people can listen to different grades of transmission 
and then express their opinions in terms of the selected yardstick. Meas- 
urements of this type are known as “subjective”? measurements. The 
results obtained depend wholly on the judgments of those taking the 
test. For this reason, large numbers of participants are required to obtain 
reliable answers. 

Laboratory tests of this nature were conducted at Bell Telephone Lab- 
oratories over different portions of the total range of received telephone 
speech volumes at various times from 1947 to 1954. The results of all 
these tests are combined and summarized here. 


II, RESULTS 


Two types of subjective transmission tests commonly made in the 
laboratory are here called “appraisal tests’? and “comparison tests”. 
During the appraisal tests observers listen to speech over a telephone 
connection with specified transmission parameters such as volume, noise 
interference, etc. The observers are then asked to give their opinions of 
the transmission qualities of the connection by assigning it to one of 
several specified categories such as ““Good”’, “‘Fair’’, ‘Poor’, or the like. 
One of the transmission parameters is then changed and the process re- 
peated. Each condition is thus rated on its own merits without direct 
comparison to any reference condition. In comparison tests, on the other 
hand, each transmission condition is compared immediately with some 
other condition, usually a known reference condition, and observers are 
asked which of the two they prefer in each case. The two types of sub- 
jective tests are useful under different circumstances that need not be 
pursued further here. The point to be noted is that results shown in this 
paper, except in one case, are based on appraisal tests. 

Distributions of appraisal categories in a very wide range of received 
telephone speech volumes are shown in Fig. 1. The range shown is con- 
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Fig. 1 — Opinion distributions — percentage of observations in which various 
values of received volume are assigned to the indicated basic categories. 


siderably greater than any to be expected in commercial telephone serv- 
ice, since it extends from received volumes which are unbearably loud to 
those which are practically inaudible. Within this range there are seven 
natural categories descriptive of the transmission performance of a tele- 
phone connection at different levels of received volume in units known 
as VU. They are here called basic categories, and are as follows: 

Intolerably Loud, 

Unpleasantly Loud, 

Good, 

Fair, 

Poor, 

Unsatisfactory, 

Could Not Understand. 
The distribution curves for these categories must not be confused with 
probability density curves, under which the total area must always equal 
unity. Instead, each one shows the percentage of total observations at 
each volume level in which that level is assigned to the indicated cate- 
gory. Since any volume level must be assigned to some category, the sum 
of the percentages of the various categories at any one level must be 100. 
For example, for a received volume of —10 VU at the line terminals of 
a 302-type telephone set, 33 per cent of the listeners would say the vol- 
ume is ‘‘Unpleasantly Loud” and 67 per cent would say that it is “Good”’. 
For a received volume of —25 VU, everybody would rate the call as 
“Good”. For a received volume of —40 VU, 43 per cent would vote 
“Good”, 50 per cent ‘Fair’ and 7 per cent ‘Poor’. 
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Fig. 2 — Opinion distributions — percentage of observations in which various 
values t received volume are assigned to the indicated categories. Solid curves 
are cumulative categories; dashed curves are basic categories. 


The various appraisal categories in Fig. 1 are centered about the 
“Good” category. Curves to the left of “Good” indicate categories which 
are progressively less acceptable because of undesirably higher received 
volumes. The categories to the right are progressively less acceptable be 
cause of lower and lower received volumes. The two end categories, 
“Tntolerably Loud” and “Could Not Understand” act as barriers be- 
cause there is no further category on one side of each of them. Thus, each 
must eventually reach 100 per cent at sufficiently high (or low) volume 
levels. The distribution of the “Good” category also reaches a maximum 
of 100 per cent, not because it is a barrier, but because it covers such a 
wide range of received volumes that there is practically unanimous opin- 
ion in the middle of that range, which is around —23 or —24 VU. 

The data shown in Fig. 1 can also be plotted in cumulative categories 
as shown by the solid curves in Fig. 2. The three dashed curves, ‘“Intol- 
erably Loud’’, ‘““Good” and “Could Not Understand” are individual 
(basic) rather than cumulative categories, copied from Fig. 


for pur- 
poses of orientation. The cumulative categories at the high-volume end 
include 


all individual categories to their left, while the cumulative cate- 
gories at the low-volume end include all individual categories to their 
right, as may be seen by comparing Figs. 1 and 2. Thus, the percentage 
value at any volume level for ‘Too Loud” on Fig. 2 i 


. 2 is the sum of the 
percentage values for ‘‘Unpleasantly Loud” and “Intolerably Loud” on 
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Fig. 1. For example, at —5 VU, 67 per cent of the listeners vote ‘“Un- 
pleasantly Loud” and 8 per cent vote “‘Intolerably Loud”. The percent- 
age in the cumulative category “Too Loud” is therefore 75, as shown 
in Fig. 2. Likewise, at —45 VU 56 per cent vote ‘Fair’, 30 per cent 
“Poor” and 3 per cent “Unsatisfactory”. Thus, on Fig. 2 ‘Fair or 
Worse” is 89 per cent and ‘Poor or Worse”’ is 33 per cent. It will be 
noticed that each of the cumulative categories in turn takes over the 
functions of “Intolerably Loud” or ‘“‘Could Not Understand”’ in acting 
as an ultimate barrier, and thus eventually reaches a value of 100 per 
cent. 

It would be too much to expect reliable results from tests at one sit- 
ting, if observers were required to carry in their heads the large number 
of basic categories shown by the curves of Figs. 1 and 2. Consequently, 
the actual tests were made over reduced ranges of received volume, one 
series in the lower range embracing the five categories from ‘‘Good”’ to 
“Could Not Understand”, inclusive, and another series in the upper 
range including the three categories from ‘‘Good” through “Intolerably 
Loud”’. 

One is struck also by the tolerance implied by the great width of the 
“Good” distribution. In addition to being a fortunate fact, it suggests 
that there must be some smaller range of volumes included within the 
“Good” category which would be found to be most acceptable of all. 
This preferred range, or “Excellent’’ category as it has sometimes been 
called, has been determined by two methods and is shown in Fig. 3. It 
was originally determined by appraisal tests similar to those employed 


in determining the results shown in Figs. 1 and 2. Some years later, the 
sarlier determination was checked by a different technique employing 
comparison tests. In the comparison tests ten different volume levels 
(covering the ‘“Good” range of Fig. 1) were compared, each with every 


other level. Each volume level was then rated according to the percent- 
age of times it was preferred over the other levels with which it was com- 
pared. 

Fig. 3 shows that the two methods give results which check each other 
satisfactorily, since the modal points differ by only 1 db. Using the ap- 
praisal distribution (to be consistent with the major tests on Figs. 1 and 
2), it appears that a received volume level of —19 VU (the modal point 
of the curve) is the value preferred over all others. It is interesting to 
look back at Fig. 1 and note that —19 VU occurs, not at the center of 
the “Good” distribution, but well toward the high-volume side of its 
table top. This indicates that listeners prefer about the highest volume 
they can get just short of the point where too much loudness becomes an- 
noying. 
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Fig. 3 — Preferred level of received volume, appraisal test versus comparison 
test. Appraisal test curve shows percentage of observations in which the indicated 
volume level is assigned to the preferred category; modal point is —19 VU. Com 
parison test curve shows percentage of observations in which the indicated level 
is preferred in comparison with other levels tested; modal point is —18 VU. 


III. DESCRIPTION OF TESTS 


It was recognized that the intelligibility of received telephone speech, 
and the listener’s satisfaction, are affected by transmission factors other 
than just speech volume, or loudness. Consequently, wherever the effect 
of other factors might have been significant, they were controlled during 
the tests at values which cause little or no transmission impairment. 
The factors which were controlled and their values are as follows: 

line noise: 17 dba at telephone receiver ;* 
room noise: 50 db RAP (reference acoustic pressure.) ; 
speech transmission band: 150 to 3200 eps. 

The test circuit is shown schematically in Fig. 4. It is patterned on a 
typical telephone circuit employing 302-type telephone sets, but with 
means provided for varying the loss of the trunk so that different levels 


* Dba is the unit employed in the Bell System for measurements of line noise 
with the 2B Noise Measuring Set. This unit was adopted when the 302-type tele 
phone set came into use, in order to provide equal numerical readings of the 2B 
set when noise of equal impairment was encountered in circuits with different 
types of telephone set. 
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Fig. 5 — Observers at multiple listening positions during appraisal tests 


of speech volume may be obtained at the receiving end. A volume indi- 
cator at the transmitting end of the test circuit enables the talker to 
keep the level of his own speech constant, thus providing means for de- 
termining, from the known losses in between, the level of speech volume 
at the receiving telephone. The transmitter of the receiving telephone 
set is kept operative, so that a normal amount of room noise will be pres- 
ent in the receiver through the sidetone path. A multiple listening ar- 


rangement is provided so that many observers may listen simultaneously, 


each hearing the same speech volume and room noise through the side- 
tone path that are present at the main receiving station. This permits 
gathering a large amount of data with a minimum expenditure of time 
and effort on the part of those conducting the tests. A group of observers 
is shown taking the test at the multiple positions in the photograph in 
Fig. 5. Fig. 6 shows the board at which the test circuit is set up and con- 
trolled, and the main receiving station to which the observers’ handsets 
are multipled. The talking position does not show in the photographs 
since it is located in an adjacent soundproofed room. 

At first, three men and one woman were used as talkers. It was found 
that differences in the type of voice (even in the case of the woman talker) 
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Fig. 6 — Circuit control board and main receiving station for appraisal tests. 


had almost no effect on the opinions of listeners, provided that each talker 
impressed the same speech volume on the line. Consequently, in the later 
tests almost all of the talking was done by one of the original male talk- 
ers. 

The test itself was a very simple and straightforward procedure. Ob- 
servers were allowed to listen to speech over the test circuit at a particu- 
lar level of received volume. They were then asked to assign that sample 
to one of the categories previously listed for them. In the case of the tests 
at the lower levels the categories, as already stated, were ‘‘Good”’, “Fair’’, 
“Poor”, “Unsatisfactory” and ‘Could Not Understand’’, while those in 
the high-level tests were “Good”, ‘‘Unpleasantly Loud” and “‘Intoler- 
ably Loud’’. No attempt was made to define these categories for the ob- 
servers, each one deciding for himself what the terms meant. When ob- 
servers had recorded their opinions of the first speech sample, the received 
volume was changed by adjusting the variable attenuator shown in Fig. 
4, and the process was repeated until the entire range had been covered 
three times. The specific values of received volume used and the ranges 
covered in the different series of appraisal tests are shown in Table I. 


The different volume levels were presented to listeners in random order. 


This is necessary because it has been found that sequential orders (high 
to low or low to high) result in displacements of the entire opinion dis- 
tribution, apparently due to conditioning of observers by volume levels 
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to which they have become accustomed. No disclosure of the magnitudes 
of the volume levels was made until the completion of a test series. 

The process is illustrated by the test sheet shown in Fig. 7, which rep- 
resents the judgments of one individual observer in one of the tests in the 
lower range of volumes. This observer recorded his opinions of the 30 
different conditions to which he listened in the left-hand column under 
“TEST 1”. Later, the analyst recorded the received volumes correspond- 
ing to the 30 conditions and arranged the observer’s opinions in descend- 
ing order of received volumes, as shown in the right-hand portion of the 
sheet. It will be noticed that this particular observer was very consistent 
in his judgments (as most were), and that he was among those fairly 
tolerant of the lowest levels. 

Those who participated in the tests were drawn from personnel of the 
American Telephone and Telegraph Co., Bell Telephone Laboratories 
and several of the operating telephone companies. Naturally, they rep- 
presented many facets of the telephone business, but only a small minor- 
ity could be rated as experts in transmission matters. One group of more 
than 150 men consisted of young engineers just hired by Bell Laborato- 
ries who had practically no telephone experience except as users. The re- 


TABLE I — VotuME LEVELS USED IN VARIOUS APPRAISAL TESTS 


VU at Input of Receiving 302-Type Telephone Set 


High-Range Tests Preferred-Range Tests Low-Range Tests 
+6 
+3 

0 
—3 
—b 
~9 

—12 
—15 
—18 
—21 
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Fig. 7 — Sample test sheet for appraisal tests. 


sults obtained from these various groups were analyzed separately to 
determine whether different age groups (with corresponding differences 
in hearing acuity) or differences in transmission background might be a 
factor in their appraisals of received volume. In general, it was found 
that there was no significant difference between the opinions of different 
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groups. For example, the data obtained from the student engineer group 
alone differed from the data for the entire group by no more than + 0.5 
db. 

A sample of the data in cumulative categories is shown in Fig. 8. By 
plotting the experimental points on arithmetie-probability paper one is 
able to smooth the data by passing a straight line through the data 
points. Except for some divergence at the tails of the distributions, the 
data are a reasonable approximation to the straight lines, and therefore 
to the normal law of error. The distributions of cumulative categories in 
Fig. 2 are taken directly from the straight lines of Fig. 8 and from similar 
distributions for the categories not shown here. The distributions of basic 
sategories in Fig. 1 are obtained by taking vertical differences between 
the straight lines of Fig. 8 at each level of received volume. Thus, “Un- 
pleasantly Loud” is the difference between ““Too Loud” and “Intolerably 
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Fig. 8 — Sample of data for opinion distributions — percentage of observations 
in which various received volumes are assigned to indicated cumulative categories. 
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Loud’’, ‘Fair’ is the difference between ‘‘Fair or Worse” and ‘‘Poor or 
Worse’’, and so on. 

The reliability of the data is attested by the fact that, for the low- 
volume categories, they are based on approximately 1400 observations 
by 470 different listeners at each of the values of received volume indi- 
cated in Table I. The data for the high-volume categories are the result 
of 528 observations by 176 observers at each indicated level. 

These two series have been spoken of here as the “‘major’’ tests to dis- 
tinguish them from the tests of the preferred range of received volume 
levels, the results of which are shown in Fig. 3. The appraisal tests of 
Fig. 3 were made in the same manner as the major tests (at an earlier 
date), but were based on fewer observations, 56 at each of the levels in- 
dicated on Table I. It was decided that results based on so few observa- 
tions might be questionable, and that further tests to confirm the earlier 
results were warranted. Instead of additional appraisal tests, the type of 
comparison tests which has already been described seemed applicable to 
the objective of obtaining the preferred volume level, and was adopted. 

The talking circuit for the comparison tests was the same as that used 
in the appraisal tests (ig. 4). In these tests each of the ten selected vol- 
ume levels was compared with every other level twice, once with the 
level A preceding level B in the presentation, and again in the opposite 
order, B preceding A. However, the two comparisons of like levels were 
not made consecutively, but were interspersed among other combina- 
tions. The comparisons were thus presented to observers in an order that 
was random both as to the volume levels involved in the comparison and 


TABLE I] ANALYSIS OF PREFERRED-RANGE COMPARISON TESTS 


Times That Volume Level in Left Column Was Preferred Over Level 
Level in Headings Below 
Pre 
ferred 


— 


as =.9 


0 0 
0 0 
0 0 


Pat Se ee OO 


wna 


110 | 79 ‘ 20 20 51 , 98 | 109 


W (Won) denotes that the level in left column was preferred. 
L (Lost) denotes that the level in left column was not preferred. 
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as to the precedence of one value over the other in a combination. As in 
the appraisal tests, the magnitudes of received volumes were not dis- 
closed to observers until after the tests were completed. 

The method of analysis of the comparison test data is somewhat un- 
usual. If each observed comparison between two volume levels is consid- 
ered as a game between two teams, the ‘games won” and “games lost”’ 
may be charted in the same manner as the results usually published for 
baseball leagues for a season in which each team plays the same number 
of games with every other team. This analysis is shown in Table IT, the 
right-hand column of which shows the percentage of games “won” by 
each of the ten volume levels, that is, the percentage of observed compar- 
isons in which each of the indicated levels is preferred over the other nine 
levels with which it is compared. The values from this column are plotted 
as data points for the comparison test distribution shown in Fig. 3. 


IV. APPLICATIONS 


The tests which have been described and the curves which show the 
results of these tests provide fundamental data which, in the authors’ 
opinion, are indicative of a telephone customer’s expectancy with regard 
to hearing and being heard with a minimum of effort on his part. In this 
respect, they are a measure of what is required to satisfy telephone users. 
By themselves, the results of the tests indicate observers’ opinions (as to 
category) of any specific value of received volume, such as might be en- 
countered on any one individual telephone connection. 

While this is of interest, it is more important to the management of an 
operating telephone company to know customers’ reactions to the grade 


of service provided over that company’s telephone plant, since this in- 


formation may affect decisions on such questions as spending money for 
plant improvements. Specifically, the management should know the per- 
centage of telephone connections over the plant which customers consider 
good, the percentage fair, the percentage poor, ete. Such information 
may be obtained by combining the opinion distributions described here 
with the estimated distribution of received volumes which customers ac- 
tually obtain in their daily use of the telephone. This is done by integrat- 
ing, over the range of received volumes, the compound probability: (a) 
that the telephone user considers a particular volume ‘‘Good”’ (or ‘‘Fair”’ 
or ‘‘Poor’’), and (b) that he actually receives that volume. It should be 
pointed out that the distribution of volumes actually received is, in turn, 
a combination of the distribution of talking volumes, which vary over a 
range of some 30 VU, and the distribution of plant losses from the point 
of the talking volume measurement to the input of a listener’s telephone 
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Fig. 9 — Circuit showing points of volume measurement and intervening plant losses. 


set. Fig. 9 is a simple diagram which illustrates this point. It shows a 
transmitting telephone and a loop to the central office. Field measure- 
ments of telephone users’ talking volume indicated that the average vol- 
ume input into the line leaving the central office was —19 VU, with a 
standard deviation of 5.9 db at the time the survey was made.' An anal- 
ysis made several years ago of the large variety of telephone connections 
encountered in the telephone plant indicated that the average transmis- 
sion loss between the talker’s central office and the distant listener’s tele- 
phone set was 12 db, with a standard deviation of 3.8 db. The average 
received volume at the telephone terminals was therefore —31 VU, with 
a standard deviation of 7.0 db.* 

The results of combining the distributions of observers’ opinions and 
actual received volumes have been called “grade of service’’. “Grade of 
service” curves may be plotted showing the percentage of telephone con- 
nections considered ‘“Good”’, “Fair’’, ‘Poor’, ete., against the average of 
various received volume distributions, assuming that these distributions 
are allowed to vary in average value, but not in standard deviation. In 
this form they have been found useful in setting objectives for received 
volumes to be provided and for plant losses which will permit attain- 
ment of those received volumes. An example of grade of service curves in 
cumulative categories is shown in Fig. 10. The same data in another 
form, which includes individual categories, appear in Fig. 11. For an 


over-all picture of grade of service provided in the plant just described, 
assuming that all telephone sets were of the 302 type, the average re- 
ceived volume may be taken as —31 VU, with a standard deviation of 
7.0 db, as indicated in Fig. 9. This average value constitutes one point 
on the abscissae of Figs. 10 and 11. It will be noted in Fig. 11 that, if 
the average of the distribution of received volumes is —31 VU, the tele- 


* In this calculation it is assumed that talker volume and plant loss are inde- 
pendent variables. It is recognized that there may be some small correlation be- 
tween them. For instance, Subrizi! found a small correlation between talker vol- 
ume and distance between talker and listener on very long distance calls. Other 
tests to determine correlation have given conflicting results so the correlation is 
probably small enough to justify the assumption. 
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phone users may be presumed to consider about 83 per cent of all tele- 
phone connections ‘Good’, 14 per cent ‘Fair’ and 3 per cent “Poor or 
Worse’”’. 

If the distribution of received volumes may be assumed to follow the 
normal law, as in the example just given, the labor of constructing the 
grade of service curves may be greatly reduced by a simple device. While 
not immediately obvious, it can be shown by probability theory that the 
grade of service curves in cumulative categories (Fig. 10) have the same 
average values as the corresponding curves of observers’ opinions (lig. 
8). Also, each of their standard deviations is equal to the root-sum-square 


of the standard deviations of the opinion distribution and the distribu- 
tion of received volumes. Thus, the ‘Fair or Worse” curve in Fig. 10 
has o = +/4.8? + 7.02, or 8.5 db, while its average value, —39 VU, 




















Fig. 10 — Grade of service distributions in cumulative categories provided by 
plant of recent past percentage of telephone connections assignable to various 
categories when the average of received volume distribution has indicated value 
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Fig. 11 Grade of service distributions provided by plant of recent past - 
percentage of telephone connections assignable to various categories when the 
average of the received volume distribution has the indicated value. 


is the same as that of the corresponding opinion curve on Fig. 8. By us- 
ing arithmetic-probability paper, this simple construction of Fig. 10 re- 
places laborious summations of the compound probabilities. The summa- 
tions cannot be avoided, however, if the distribution of received volumes 
is other than normal law. 

For applications such as these, some reminders concerning the nature 
of the fundamental data presented here will not be out of place. Ob- 
servers’ opinions as to the category of specific values of received volume 
have been found to vary with (a) the amount of interference present in 
the form of line noise and ambient room noise at the listener’s location, 
(b) the upper cutoff frequency of the telephone message channel and (c) 
the sensitivity of the listening telephone set and receiver, both for speech 
and for noise. As indicated early in this paper, data shown in Figs. 1 and 
2 apply only to conditions of line noise, room noise and transmission 
bandwidth which have been found to cause little or no transmission 
impairment. Line noise and transmission bandwidth are controllable fac- 
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tors, and the values used are those which have been worked to as objec- 
tives in the Beil System for many years. Room noise is not under control 
of the telephone companies, but the value used is one that has been found 
representative of room noise in the average residence or fairly quiet of- 
fice. In addition, the data of Figs. 1 and 2 apply only to receiving tele- 
phone sets of the 302 type, the type in greatest use in the telephone 
plant of the early 1950’s, but now being largely supplanted by the 500 
type. While all the conditions mentioned were reasonably normal a few 
years ago, it must be borne in mind that changes are being made over 
the years which necessitate adjustment of the opinion distributions with 
respect to the VU scale. 

Long-range objectives for received loudness in the Bell System have 
been the subject of much study in recent years. W. K. MacAdam, Trans- 
mission Engineer of the American Telephone and Telegraph Company, 
has stated? that the design objectives of the Bell System might be about 
as follows: 

i. A negligible number of calls rated ‘“Poor’’. 

ii. No more than 5 per cent rated “Fair’’. 

iii. The balance rated ‘‘Good” or “Excellent’’. 

It is evident that the telephone plant of a few years ago did not fully 
meet this objective. However, the picture is altered in the case of the 
future plant. If modern sets of the 500 type having higher receiving sensi- 
tivity are employed, the opinion curves will be found to shift with respect 
to the VU seale by approximately the amount of the sensitivity differ- 
ence.* Thus, a value around —23 VU, rather than — 19, would represent 
the volume level preferred over all others. In addition, it has been found 
that, with the 500-type telephone set, although the average talker vol- 
ume output for a given level of acoustic input is increased, the spread in 
volume is decreased (smaller standard deviation) because there is a cer- 
tain amount of speech compression on higher volumes. Furthermore, 
planned improvements in telephone lines (more precisely, the transmis- 
sion medium between telephone sets) are expected to result in lower 
losses and lower standard deviation. It should be noted that, in the re- 
gion of received volumes in which we are interested, lower standard devi- 
ation contributes to improved grade of service, just as higher average 
values of received volume do. 

Based on these and other considerations, estimates have been made of 
the grade of service which will be provided by the future telephone plant, 


assuming that it includes the 500-type telephone set preponderantly, 


* Assuming the total noise reaching the user’s ear remains unchanged. Because 
of improvements in the sidetone circuit this is roughly the case. 
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Fig. 12 — Grade of service distributions estimated for the future plant — per- 


centage of telephone connections assignable to various categories when the aver- 
age of the received volume distribution has the indicated value. 


and the contemplated line improvements. Distribution curves based on 
these estimates are shown in Fig. 12. It is estimated that the average re- 
ceived volume resulting from the improvements mentioned will be —28 
VU, as indicated in this figure. The percentage of calls rated ‘““Good” 
would then exceed the objective of 95, while the calls rated “Fair” would 
come well within the 5 per cent objective. This is a satisfactory outlook 
for the future. 

We then shall have gone about as far in the direction of increasing re- 
ceived volumes as we should. Fig. 12 shows that, at an average received 
volume of —28 VU, the percentage of calls considered “Too Loud” is about 
1.5. Any further increase in received volume might raise the percentage 
“Too Loud” to undesirable values, with a corresponding decrease in the 
percentage of calls rated “‘“Good”’. The fact that the “Good” distribution 
turns down at higher volume levels is the reason why it is advisable to 
set an objective for the average value of the received volume distribution 
at a volume level near the peak of the ‘‘Good” distribution, around 
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—28 VU, rather than at the preferred volume level, —23 VU (for the 
future plant with 500 sets.) Once we have increased the average value 
of received volumes to this objective, no further improvement in grade 


of service may be expected (should it become desirable) except by lower- 
ing the standard deviation still further. 
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